index
:
llama.cpp.git
master
llama.cpp
user
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2023-03-13
Use vdotq_s32 to improve performance (#67)
Georgi Gerganov
2023-03-13
Reduce model loading time (#43)
uint256_t
2023-03-13
Fix UTF-8 handling (including colors) (#79)
Val Kharitonov
2023-03-13
Add quantize script for batch quantization (#92)
Pavol Rusnak
2023-03-13
Add initial contribution guidelines
Georgi Gerganov
2023-03-13
Gate signal support on being on a unixoid system. (#74)
Matvey Soloviev
2023-03-13
Fix token count accounting
Matvey Soloviev
2023-03-13
Revert "10% performance boost on ARM"
Georgi Gerganov
2023-03-13
Check for vdotq_s32 availability
Georgi Gerganov
2023-03-13
Ammend to previous commit - forgot to update non-QRDMX branch
Georgi Gerganov
2023-03-13
10% performance boost on ARM
Georgi Gerganov
2023-03-13
Fix color getting reset before prompt output done (#65)
Matvey Soloviev
2023-03-12
Update README.md
Georgi Gerganov
2023-03-12
Add interactive mode (#61)
Matvey Soloviev
2023-03-12
Fix typo in README (#45)
Marc Köhlbrugge
2023-03-12
Allow using prompt files (#59)
Ben Garney
2023-03-12
Add back top_k (#56)
beiller
2023-03-12
Windows fixes (#31)
Sebastián A
2023-03-12
Update README.md
Georgi Gerganov
2023-03-12
Add CI (#60)
Georgi Gerganov
2023-03-12
Revert "weights_only" arg - this causing more trouble than help
Georgi Gerganov
2023-03-12
python/pytorch compat notes (#44)
Oleksandr Nikitin
2023-03-12
Add repetition penalty (#20)
beiller
2023-03-12
Clarify meaning of hacking
Georgi Gerganov
2023-03-12
README: add "Supported platforms" + update hot topics
Georgi Gerganov
2023-03-12
use weights_only in conversion script (#32)
deepdiffuser
2023-03-12
Add LICENSE (#21)
Pavol Rusnak
2023-03-12
Update README.md
Georgi Gerganov
2023-03-11
Fix a typo in model name (#16)
Juraj Bednar
2023-03-11
Update README.md
Georgi Gerganov
2023-03-11
Add AVX2 support for x86 architectures thanks to @Const-me !
Georgi Gerganov
2023-03-11
Fix un-initialized FP16 tables on x86 (#15, #2)
Georgi Gerganov
2023-03-11
Bump memory buffer
Georgi Gerganov
2023-03-11
Update README.md
Georgi Gerganov
2023-03-11
.gitignore models/
Georgi Gerganov
2023-03-11
Update Makefile var + add comment
Georgi Gerganov
2023-03-11
Update README.md
Georgi Gerganov
2023-03-11
Update README.md
Georgi Gerganov
2023-03-11
Support all LLaMA models + change Q4_0 quantization storage
Georgi Gerganov
2023-03-11
Include Python dependencies in README (#6)
Simon Willison
2023-03-11
Update README.md
Georgi Gerganov
2023-03-11
Update README.md
Georgi Gerganov
2023-03-11
Update README.md
Georgi Gerganov
2023-03-11
Add missing headers for memcpy and assert (#3)
Jean-Michaël Celerier
2023-03-11
Update README.md
Georgi Gerganov
2023-03-11
Update README.md
Georgi Gerganov
2023-03-11
Update README.md
Georgi Gerganov
2023-03-10
Update README.md
Georgi Gerganov
2023-03-10
Fix a bug in the rope calculation
Georgi Gerganov
2023-03-10
Update README.md
Georgi Gerganov
[prev]
[next]