index
:
llama.cpp.git
master
llama.cpp
user
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2023-03-28
main.cpp fixes, refactoring (#571)
anzz1
2023-03-28
Add embedding example to Makefile (#540)
RJ Adriaansen
2023-03-27
Fix missing ggml link in cmake for examples/* on w64-mingw32 (#542)
Marco Matthies
2023-03-26
ci: add debug build to sanitizer build matrix (#527)
Erik Scholz
2023-03-26
Fix undefined variables in debug build, remove unused variables (#531)
Stephan Walter
2023-03-26
Add support for linux/arm64 platform during Docker Builds (#514)
Juan Calderon-Perez
2023-03-26
Update README and comments for standalone perplexity tool (#525)
Stephan Walter
2023-03-26
[main] fix infinite generation (-n == -1) (#523)
anzz1
2023-03-26
Add logo to README.md
Georgi Gerganov
2023-03-26
Exit from interactive mode if input stream is bad (#491)
Harald Fernengel
2023-03-26
CI: Run other sanitizer builds even if one fails (#511)
anzz1
2023-03-25
Clarify console output in convert-pth-to-ggml.py (#512)
jp-x-g
2023-03-25
CMake / CI additions (#497)
anzz1
2023-03-25
(Windows) Set console to UTF-8 on init (#420)
anzz1
2023-03-25
Fix colors enabling on WIN32
Georgi Gerganov
2023-03-25
If n_predict == -1, generate forever
Georgi Gerganov
2023-03-25
Inifinite generation via context swapping (#71)
Georgi Gerganov
2023-03-25
Cleanup STL headers + fix embedding examples + minor stuff
Georgi Gerganov
2023-03-25
Move chat scripts into "./examples"
Georgi Gerganov
2023-03-25
Add AVX2 implementation of dequantize_row_q4_1 (#505)
slaren
2023-03-25
Overhaul the examples structure
Georgi Gerganov
2023-03-25
Retire the ggml_mul_mat() branch for transposed src0 (#500)
Georgi Gerganov
2023-03-25
Disable prompt verbosity by default and add option to enable (#480)
Georgi Gerganov
2023-03-25
Add AVX2 implementation of dequantize_row_q4_0 (#467)
slaren
2023-03-25
Don't interefe with BLAS for large prompts by running only 1 thread
Georgi Gerganov
2023-03-25
Add longer DAN prompt for testing big batch numbers
Georgi Gerganov
2023-03-25
Add timings for the prompt evaluation (#478)
slaren
2023-03-25
Remove obsolete information from README
Georgi Gerganov
2023-03-25
Remove obsolete assert and fix compiler warning
Georgi Gerganov
2023-03-25
Fix nasty bug in ggml_compute_forward_mul_mat_f32() and reenable BLAS
Georgi Gerganov
2023-03-25
bounds checking for input prefix (#492)
anzz1
2023-03-25
feat: '--in-prefix STRING' option (#426)
anzz1
2023-03-25
Add support for file load progress reporting callbacks (#434)
Jed Fox
2023-03-25
Add missing struct annotation (#483)
Doomsdayrs
2023-03-25
Fix crash for 65B model with pre-allocated memory (#485)
Chris Kuehl
2023-03-24
Disable BLAS altogether - the bug is not just for qunatized mat mul
Georgi Gerganov
2023-03-24
Disable BLAS branch in mul_mat - seems there is a bug
Georgi Gerganov
2023-03-24
Immediately start processing the prompt before user input has been provided (...
Georgi Gerganov
2023-03-24
Reduce memory usage and allocate enough memory for largest context (#473)
Georgi Gerganov
2023-03-24
Temporary bump the memory buffer size - hopefully fix issues from 483bab2e
Georgi Gerganov
2023-03-24
Update README.md (#444)
Gary Mulder
2023-03-24
fix instruct mode (#445)
rabidcopy
2023-03-24
Properly free llama_context on failure
Georgi Gerganov
2023-03-24
additional optimizations for POWER9 (#454)
Cameron Kaiser
2023-03-24
Support calling mlock() on loaded model data on Linux and macOS (#453)
comex
2023-03-24
Add embedding mode with arg flag. Currently working (#282)
Luciano
2023-03-24
Add link to Roadmap discussion
Georgi Gerganov
2023-03-24
Revert "Fix memory allocation issues and seg faults"
Georgi Gerganov
2023-03-24
Fix memory allocation issues and seg faults
Georgi Gerganov
2023-03-23
Avoid the transposed X branch in the Z = X * Y matrix multiplication (#439)
Georgi Gerganov
[next]