index
:
llama.cpp.git
master
llama.cpp
user
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
.gitignore
Age
Commit message (
Expand
)
Author
2023-06-04
llama : Metal inference (#1642)
Georgi Gerganov
2023-05-13
llama : fix various warnings
Georgi Gerganov
2023-05-12
ggml : remove bit shuffling (#1405)
Georgi Gerganov
2023-05-08
llama : require first token to be BOS (#1303)
Georgi Gerganov
2023-05-06
Remove default arguments from sampling functions (#1343)
Jed Fox
2023-05-01
Add git-based build information for better issue tracking (#1232)
DannyDaemonic
2023-04-30
Various fixes to mat_mul benchmark (#1253)
Stephan Walter
2023-04-28
examples : add Jeopardy example (#1168)
CRD716
2023-04-26
ggml : add Q5_0 and Q5_1 quantization (#1187)
Georgi Gerganov
2023-04-19
ggml : use 8-bit precision for Q4_1 intermediate results (#1047)
Georgi Gerganov
2023-04-18
gitignore : vdot
Georgi Gerganov
2023-04-13
gitignore : benchmark
Georgi Gerganov
2023-04-08
Add quantize-stats command for testing quantization (#728)
unbounded
2023-04-05
zig : add build.zig (#773)
iacore
2023-03-30
Make loading weights 10-100x faster
Justine Tunney
2023-03-28
deploy : add a Package.swift for SwiftPM support (#393)
Jed Fox
2023-03-28
gitignore : add "embedding"
Georgi Gerganov
2023-03-25
Overhaul the examples structure
Georgi Gerganov
2023-03-17
Nix flake (#40)
Niklas Korz
2023-03-11
.gitignore models/
Georgi Gerganov
2023-03-10
Initial release
Georgi Gerganov