index
:
llama.cpp.git
master
llama.cpp
user
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
README.md
Age
Commit message (
Expand
)
Author
2023-07-04
readme : add link web chat PR
Georgi Gerganov
2023-07-01
convert : add support of baichuan-7b (#2055)
Judd
2023-06-26
readme : add Scala 3 bindings repo (#2010)
Roman Parykin
2023-06-26
readme : LD_LIBRARY_PATH complement for some Android devices when building wi...
Gustavo Rocha Dias
2023-06-26
readme : add link to new k-quants for visibility
Georgi Gerganov
2023-06-25
readme : add new roadmap + manifesto
Georgi Gerganov
2023-06-25
readme : add Azure CI discussion link
Georgi Gerganov
2023-06-24
readme : fix whitespaces
Georgi Gerganov
2023-06-24
readme : fixed termux instructions (#1973)
Alberto
2023-06-23
Add OpenLLaMA instructions to the README (#1954)
eiery
2023-06-21
Fix typo in README.md (#1961)
Rahul Vivek Nair
2023-06-20
readme : add link to p1
Georgi Gerganov
2023-06-20
Fix typo (#1949)
Xiake Sun
2023-06-19
Convert vector to f16 for dequantize mul mat vec (#1913)
Johannes Gäßler
2023-06-18
readme : update Android build instructions (#1922)
Mike
2023-06-17
Only one CUDA stream per device for async compute (#1898)
Johannes Gäßler
2023-06-17
readme : alternative way to build for Android with CLBlast. (#1828)
Gustavo Rocha Dias
2023-06-10
doc : fix wrong address of BLIS.md (#1772)
Aisuko
2023-06-07
readme : add June roadmap
Georgi Gerganov
2023-06-05
docs : add performance troubleshoot + example benchmark documentation (#1674)
Yuval Peled
2023-06-05
readme : fix typo (#1700)
Foul-Tarnished
2023-06-04
readme : update hot topics
Georgi Gerganov
2023-06-04
llama : Metal inference (#1642)
Georgi Gerganov
2023-06-03
Add info about CUDA_VISIBLE_DEVICES (#1682)
Henri Vasserman
2023-05-27
Add documentation about CLBlast (#1604)
Henri Vasserman
2023-05-24
readme : add docs for chat-persistent.sh (#1568)
Evan Jones
2023-05-20
feature : support blis and other blas implementation (#1536)
Zenix
2023-05-20
Revert "feature : add blis and other BLAS implementation support (#1502)"
Georgi Gerganov
2023-05-20
feature : add blis and other BLAS implementation support (#1502)
Zenix
2023-05-19
ggml : use F16 instead of F32 in Q4_0, Q4_1, Q8_0 (#1508)
Georgi Gerganov
2023-05-19
readme : adds WizardLM to the list of supported models (#1485)
David Kennedy
2023-05-13
readme : update Q4_0 perplexities
Georgi Gerganov
2023-05-12
readme : add C#/.NET bindings repo (#1409)
Rinne
2023-05-12
ggml : remove bit shuffling (#1405)
Georgi Gerganov
2023-05-08
readme : add notice about upcoming breaking change
Georgi Gerganov
2023-05-08
readme : add TOC and Pygmalion instructions (#1359)
AlpinDale
2023-05-08
llama : require first token to be BOS (#1303)
Georgi Gerganov
2023-05-08
Documented CUDA reproducibility, added warning (#1346)
Johannes Gäßler
2023-05-05
makefile: automatic Arch Linux detection (#1332)
DaniAndTheWeb
2023-05-05
readme: add missing info (#1324)
Pavol Rusnak
2023-05-04
readme : add OpenBuddy link (#1321)
44670
2023-05-03
minor : fix whitespaces (#1302)
Georgi Gerganov
2023-05-03
scripts : platform independent script to verify sha256 checksums (#1203)
KASR
2023-04-28
Remove Q4_3 which is no better than Q5 (#1218)
Stephan Walter
2023-04-28
readme : update hot topics
Georgi Gerganov
2023-04-28
Correcting link to w64devkit (#1214)
Folko-Ven
2023-04-26
readme : add quantization info
Georgi Gerganov
2023-04-26
Updating build instructions to include BLAS support (#1183)
DaniAndTheWeb
2023-04-26
quantize : use `map` to assign quantization type from `string` (#1191)
Pavol Rusnak
2023-04-24
examples/main README improvements and some light refactoring (#1131)
mgroeber9110
[next]