aboutsummaryrefslogtreecommitdiff
path: root/README.md
AgeCommit message (Expand)Author
2023-07-29CUDA: Quantized matrix matrix multiplication (#2160)Johannes Gäßler
2023-07-28Obtaining LLaMA 2 instructions (#2308)niansa/tuxifan
2023-07-23Fix __dp4a documentation (#2348)Johannes Gäßler
2023-07-23make : fix CLBLAST compile support in FreeBSD (#2331)Jose Maldonado
2023-07-21flake : remove intel mkl from flake.nix due to missing files (#2277)wzy
2023-07-19flake : update flake.nix (#2270)wzy
2023-07-16py : turn verify-checksum-models.py into executable (#2245)Jiří Podivín
2023-07-11readme : fix zig build instructions (#2171)Chad Brewbaker
2023-07-10mpi : add support for distributed inference via MPI (#2099)Evan Miller
2023-07-09readme : update Termux instructions (#2147)JackJollimore
2023-07-09readme : add more docs indexes (#2127)rankaiyx
2023-07-07docker : add support for CUDA in docker (#1461)dylan
2023-07-06convert : update for baichuan (#2081)Judd
2023-07-05Quantized dot products for CUDA mul mat vec (#2067)Johannes Gäßler
2023-07-04readme : add link web chat PRGeorgi Gerganov
2023-07-01convert : add support of baichuan-7b (#2055)Judd
2023-06-26readme : add Scala 3 bindings repo (#2010)Roman Parykin
2023-06-26readme : LD_LIBRARY_PATH complement for some Android devices when building wi...Gustavo Rocha Dias
2023-06-26readme : add link to new k-quants for visibilityGeorgi Gerganov
2023-06-25readme : add new roadmap + manifestoGeorgi Gerganov
2023-06-25readme : add Azure CI discussion linkGeorgi Gerganov
2023-06-24readme : fix whitespacesGeorgi Gerganov
2023-06-24readme : fixed termux instructions (#1973)Alberto
2023-06-23Add OpenLLaMA instructions to the README (#1954)eiery
2023-06-21Fix typo in README.md (#1961)Rahul Vivek Nair
2023-06-20readme : add link to p1Georgi Gerganov
2023-06-20Fix typo (#1949)Xiake Sun
2023-06-19Convert vector to f16 for dequantize mul mat vec (#1913)Johannes Gäßler
2023-06-18readme : update Android build instructions (#1922)Mike
2023-06-17Only one CUDA stream per device for async compute (#1898)Johannes Gäßler
2023-06-17readme : alternative way to build for Android with CLBlast. (#1828)Gustavo Rocha Dias
2023-06-10doc : fix wrong address of BLIS.md (#1772)Aisuko
2023-06-07readme : add June roadmapGeorgi Gerganov
2023-06-05docs : add performance troubleshoot + example benchmark documentation (#1674)Yuval Peled
2023-06-05readme : fix typo (#1700)Foul-Tarnished
2023-06-04readme : update hot topicsGeorgi Gerganov
2023-06-04llama : Metal inference (#1642)Georgi Gerganov
2023-06-03Add info about CUDA_VISIBLE_DEVICES (#1682)Henri Vasserman
2023-05-27Add documentation about CLBlast (#1604)Henri Vasserman
2023-05-24readme : add docs for chat-persistent.sh (#1568)Evan Jones
2023-05-20feature : support blis and other blas implementation (#1536)Zenix
2023-05-20Revert "feature : add blis and other BLAS implementation support (#1502)"Georgi Gerganov
2023-05-20feature : add blis and other BLAS implementation support (#1502)Zenix
2023-05-19ggml : use F16 instead of F32 in Q4_0, Q4_1, Q8_0 (#1508)Georgi Gerganov
2023-05-19readme : adds WizardLM to the list of supported models (#1485)David Kennedy
2023-05-13readme : update Q4_0 perplexitiesGeorgi Gerganov
2023-05-12readme : add C#/.NET bindings repo (#1409)Rinne
2023-05-12ggml : remove bit shuffling (#1405)Georgi Gerganov
2023-05-08readme : add notice about upcoming breaking changeGeorgi Gerganov
2023-05-08readme : add TOC and Pygmalion instructions (#1359)AlpinDale