index
:
llama.cpp.git
master
llama.cpp
user
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
llama.cpp
Age
Commit message (
Expand
)
Author
2023-03-25
Don't interefe with BLAS for large prompts by running only 1 thread
Georgi Gerganov
2023-03-25
Add timings for the prompt evaluation (#478)
slaren
2023-03-25
Fix nasty bug in ggml_compute_forward_mul_mat_f32() and reenable BLAS
Georgi Gerganov
2023-03-25
Add support for file load progress reporting callbacks (#434)
Jed Fox
2023-03-25
Fix crash for 65B model with pre-allocated memory (#485)
Chris Kuehl
2023-03-24
Reduce memory usage and allocate enough memory for largest context (#473)
Georgi Gerganov
2023-03-24
Temporary bump the memory buffer size - hopefully fix issues from 483bab2e
Georgi Gerganov
2023-03-24
Properly free llama_context on failure
Georgi Gerganov
2023-03-24
Support calling mlock() on loaded model data on Linux and macOS (#453)
comex
2023-03-24
Add embedding mode with arg flag. Currently working (#282)
Luciano
2023-03-24
Revert "Fix memory allocation issues and seg faults"
Georgi Gerganov
2023-03-24
Fix memory allocation issues and seg faults
Georgi Gerganov
2023-03-23
Avoid the transposed X branch in the Z = X * Y matrix multiplication (#439)
Georgi Gerganov
2023-03-22
Add missing header for memcpy (#386)
Yusuf Kağan Hanoğlu
2023-03-22
Init llama_context_params properly from CLI (#370)
Georgi Gerganov
2023-03-22
Introduce C-style API (#370)
Georgi Gerganov