aboutsummaryrefslogtreecommitdiff
path: root/Makefile
AgeCommit message (Expand)Author
2023-07-21make : add new target for test binaries (#2244)Jiří Podivín
2023-07-21make : fix embdinput library and server examples building on MSYS2 (#2235)Przemysław Pawełczyk
2023-07-14make : use pkg-config for OpenBLAS (#2222)wzy
2023-07-14make : fix combination of LLAMA_METAL and LLAMA_MPI (#2208)James Reynolds
2023-07-10mpi : add support for distributed inference via MPI (#2099)Evan Miller
2023-07-07docker : add support for CUDA in docker (#1461)dylan
2023-07-05Quantized dot products for CUDA mul mat vec (#2067)Johannes Gäßler
2023-07-04Allow old Make to build server. (#2098)Henri Vasserman
2023-07-04Update Makefile: clean simple (#2097)ZhouYuChen
2023-06-28llama : support input embeddings directly (#1910)ningshanwutuobang
2023-06-26k-quants : support for super-block size of 64 (#2001)Kawrakow
2023-06-19Convert vector to f16 for dequantize mul mat vec (#1913)Johannes Gäßler
2023-06-18metal : handle buffers larger than device's maxBufferLength (#1826)Georgi Gerganov
2023-06-17make : do not print help for simple exampleGeorgi Gerganov
2023-06-17make : update for latest Arch (#1701)DaniAndTheWeb
2023-06-17Server Example Refactor and Improvements (#1570)Randall Fitzgerald
2023-06-16examples : add "simple" (#1840)SuperUserNameMan
2023-06-16CUDA : faster k-quant dot kernels (#1862)Kawrakow
2023-06-15make : add train-text-from-scratch (#1850)daboe01
2023-06-15make : clean *.so files (#1857)sandyiscool
2023-06-13Allow "quantizing" to f16 and f32 (#1787)Kerfuffle
2023-06-10make : add SSSE3 compilation use case (#1659)rankaiyx
2023-06-07k-quants : allow to optionally disable at compile time (#1734)Georgi Gerganov
2023-06-06ggml : fix builds, add ggml-quants-k.o (close #1712, close #1710)Georgi Gerganov
2023-06-05ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)Kawrakow
2023-06-04llama : Metal inference (#1642)Georgi Gerganov
2023-05-28LLAMA_DEBUG adds debug symbols (#1617)Johannes Gäßler
2023-05-27Include server in releases + other build system cleanups (#1610)Kerfuffle
2023-05-26cuda : performance optimizations (#1530)Johannes Gäßler
2023-05-23OpenCL Token Generation Acceleration (#1459)0cc4m
2023-05-21make : .PHONY clean (#1553)Stefan Sydow
2023-05-20feature : support blis and other blas implementation (#1536)Zenix
2023-05-20Revert "feature : add blis and other BLAS implementation support (#1502)"Georgi Gerganov
2023-05-20feature : add blis and other BLAS implementation support (#1502)Zenix
2023-05-16Add alternate include path for openblas (#1476)sandyiscool
2023-05-13make : fix PERF build with cuBLASGeorgi Gerganov
2023-05-05makefile: automatic Arch Linux detection (#1332)DaniAndTheWeb
2023-05-05Fix for OpenCL / clbast builds on macOS. (#1329)Ionoclast Laboratories
2023-05-02Call sh on build-info.sh (#1294)DannyDaemonic
2023-05-01Add git-based build information for better issue tracking (#1232)DannyDaemonic
2023-04-30build: add armv{6,7,8} support to cmake (#1251)Pavol Rusnak
2023-04-30Various fixes to mat_mul benchmark (#1253)Stephan Walter
2023-04-29ggml : adjust mul_mat_f16 work memory (#1226)Georgi Gerganov
2023-04-29build : fix reference to old llama_util.hGeorgi Gerganov
2023-04-29cuBLAS: use host pinned memory and dequantize while copying (#1207)slaren
2023-04-28ggml : add CLBlast support (#1164)0cc4m
2023-04-28Add Manjaro CUDA include and lib dirs to Makefile (#1212)Johannes Gäßler
2023-04-24Fix cuda compilation (#1128)slaren
2023-04-23ggml : better PERF prints + support "LLAMA_PERF=1 make"Georgi Gerganov
2023-04-22ggml : fix AVX build + update to new Q8_0 formatGeorgi Gerganov