aboutsummaryrefslogtreecommitdiff
path: root/CMakeLists.txt
AgeCommit message (Expand)Author
2023-07-26make : build with -Wmissing-prototypes (#2394)Cebtenzzre
2023-07-21flake : remove intel mkl from flake.nix due to missing files (#2277)wzy
2023-07-19flake : update flake.nix (#2270)wzy
2023-07-19cmake : install targets (#2256)wzy
2023-07-12FP16 is supported in CM=6.0 (#2177)Howard Su
2023-07-10mpi : add support for distributed inference via MPI (#2099)Evan Miller
2023-07-09ggml : fix buidling with Intel MKL but ask for "cblas.h" issue (#2104) (#2115)clyang
2023-07-05Quantized dot products for CUDA mul mat vec (#2067)Johannes Gäßler
2023-07-04Simple webchat for server (#1998)Tobias Lütke
2023-07-01cmake : don't force -mcpu=native on aarch64 (#2063)Daniel Drake
2023-06-26k-quants : support for super-block size of 64 (#2001)Kawrakow
2023-06-21cmake: revert CUDA arch default to 52, 61 if f16 (#1959)Johannes Gäßler
2023-06-19cmake : fix trailing whitespacesGeorgi Gerganov
2023-06-19cmake : fix build shared ggml when CUDA is enabled (#1929)Howard Su
2023-06-19Convert vector to f16 for dequantize mul mat vec (#1913)Johannes Gäßler
2023-06-18cmake : add CUDA_ARCHITECTURES to new target ggml_static (#1917)Howard Su
2023-06-17Allow cmake to build ggml as a library (#1896)Kerfuffle
2023-06-16cmake : add auto detection of BLAS_INCLUDE_DIRS (#1886)Zenix
2023-06-16CUDA : faster k-quant dot kernels (#1862)Kawrakow
2023-06-15cmake : remove whitespacesGeorgi Gerganov
2023-06-15cmake : set include path for OpenBlas (#1830)Igor Okulist
2023-06-10cmake : fix Metal build (close #1791)Georgi Gerganov
2023-06-10metal : fix issue with ggml-metal.metal path. Closes #1769 (#1782)Andrei
2023-06-08k-quants : add missing compile definition to CMakeLists (#1748)johnson442
2023-06-07k-quants : allow to optionally disable at compile time (#1734)Georgi Gerganov
2023-06-05ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)Kawrakow
2023-06-04llama : Metal inference (#1642)Georgi Gerganov
2023-05-27[CI] Fix openblas (#1613)Henri Vasserman
2023-05-26cuda : performance optimizations (#1530)Johannes Gäßler
2023-05-23OpenCL Token Generation Acceleration (#1459)0cc4m
2023-05-21examples : add server example with REST API (#1443)Steward Garcia
2023-05-20feature : support blis and other blas implementation (#1536)Zenix
2023-05-20Revert "feature : add blis and other BLAS implementation support (#1502)"Georgi Gerganov
2023-05-20feature : add blis and other BLAS implementation support (#1502)Zenix
2023-05-03fix build-info.h for git submodules (#1289)kuvaus
2023-05-02ggml : fix ppc64le build error and make cmake detect Power processors (#1284)Marvin Gießing
2023-05-01Add git-based build information for better issue tracking (#1232)DannyDaemonic
2023-04-30build: add armv{6,7,8} support to cmake (#1251)Pavol Rusnak
2023-04-29build : fix reference to old llama_util.hGeorgi Gerganov
2023-04-28ggml : add CLBlast support (#1164)0cc4m
2023-04-22ggml : fix Q4_3 cuBLASGeorgi Gerganov
2023-04-22cmake : fix build under Windows when enable BUILD_SHARED_LIBS (#1100)Howard Su
2023-04-21cmake : link threads publicly to ggml (#1042)源文雨
2023-04-20Improve cuBLAS performance by dequantizing on the GPU (#1065)slaren
2023-04-19ggml : Q4 cleanup - remove 4-bit dot product code (#1061)Stephan Walter
2023-04-19Add NVIDIA cuBLAS support (#1044)slaren
2023-04-18Adding a simple program to measure speed of dot products (#1041)Kawrakow
2023-04-17Speedup the AVX-512 implementation of ggml_vec_dot_q4_0() (#933)Ivan Komarov
2023-04-15cmake : add finding the OpenBLAS header file (#992)katsu560
2023-04-13llama : merge llama_internal.h into llama.hGeorgi Gerganov