index
:
llama.cpp.git
master
llama.cpp
user
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
README.md
Age
Commit message (
Expand
)
Author
2023-08-02
readme : add Aquila-7B model series to supported models (#2487)
ldwang
2023-08-02
readme : Add Chinese LLaMA-2 / Alpaca-2 to supported models (#2475)
Yiming Cui
2023-07-31
CUDA: mmq CLI option, fixed mmq build issues (#2453)
Johannes Gäßler
2023-07-29
CUDA: Quantized matrix matrix multiplication (#2160)
Johannes Gäßler
2023-07-28
Obtaining LLaMA 2 instructions (#2308)
niansa/tuxifan
2023-07-23
Fix __dp4a documentation (#2348)
Johannes Gäßler
2023-07-23
make : fix CLBLAST compile support in FreeBSD (#2331)
Jose Maldonado
2023-07-21
flake : remove intel mkl from flake.nix due to missing files (#2277)
wzy
2023-07-19
flake : update flake.nix (#2270)
wzy
2023-07-16
py : turn verify-checksum-models.py into executable (#2245)
Jiří Podivín
2023-07-11
readme : fix zig build instructions (#2171)
Chad Brewbaker
2023-07-10
mpi : add support for distributed inference via MPI (#2099)
Evan Miller
2023-07-09
readme : update Termux instructions (#2147)
JackJollimore
2023-07-09
readme : add more docs indexes (#2127)
rankaiyx
2023-07-07
docker : add support for CUDA in docker (#1461)
dylan
2023-07-06
convert : update for baichuan (#2081)
Judd
2023-07-05
Quantized dot products for CUDA mul mat vec (#2067)
Johannes Gäßler
2023-07-04
readme : add link web chat PR
Georgi Gerganov
2023-07-01
convert : add support of baichuan-7b (#2055)
Judd
2023-06-26
readme : add Scala 3 bindings repo (#2010)
Roman Parykin
2023-06-26
readme : LD_LIBRARY_PATH complement for some Android devices when building wi...
Gustavo Rocha Dias
2023-06-26
readme : add link to new k-quants for visibility
Georgi Gerganov
2023-06-25
readme : add new roadmap + manifesto
Georgi Gerganov
2023-06-25
readme : add Azure CI discussion link
Georgi Gerganov
2023-06-24
readme : fix whitespaces
Georgi Gerganov
2023-06-24
readme : fixed termux instructions (#1973)
Alberto
2023-06-23
Add OpenLLaMA instructions to the README (#1954)
eiery
2023-06-21
Fix typo in README.md (#1961)
Rahul Vivek Nair
2023-06-20
readme : add link to p1
Georgi Gerganov
2023-06-20
Fix typo (#1949)
Xiake Sun
2023-06-19
Convert vector to f16 for dequantize mul mat vec (#1913)
Johannes Gäßler
2023-06-18
readme : update Android build instructions (#1922)
Mike
2023-06-17
Only one CUDA stream per device for async compute (#1898)
Johannes Gäßler
2023-06-17
readme : alternative way to build for Android with CLBlast. (#1828)
Gustavo Rocha Dias
2023-06-10
doc : fix wrong address of BLIS.md (#1772)
Aisuko
2023-06-07
readme : add June roadmap
Georgi Gerganov
2023-06-05
docs : add performance troubleshoot + example benchmark documentation (#1674)
Yuval Peled
2023-06-05
readme : fix typo (#1700)
Foul-Tarnished
2023-06-04
readme : update hot topics
Georgi Gerganov
2023-06-04
llama : Metal inference (#1642)
Georgi Gerganov
2023-06-03
Add info about CUDA_VISIBLE_DEVICES (#1682)
Henri Vasserman
2023-05-27
Add documentation about CLBlast (#1604)
Henri Vasserman
2023-05-24
readme : add docs for chat-persistent.sh (#1568)
Evan Jones
2023-05-20
feature : support blis and other blas implementation (#1536)
Zenix
2023-05-20
Revert "feature : add blis and other BLAS implementation support (#1502)"
Georgi Gerganov
2023-05-20
feature : add blis and other BLAS implementation support (#1502)
Zenix
2023-05-19
ggml : use F16 instead of F32 in Q4_0, Q4_1, Q8_0 (#1508)
Georgi Gerganov
2023-05-19
readme : adds WizardLM to the list of supported models (#1485)
David Kennedy
2023-05-13
readme : update Q4_0 perplexities
Georgi Gerganov
2023-05-12
readme : add C#/.NET bindings repo (#1409)
Rinne
[next]