index
:
llama.cpp.git
master
llama.cpp
user
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2023-03-29
py : add GPT4All conversion script
Georgi Gerganov
2023-03-29
llama : use the same threshold for OpenBLAS and ggml thread limiting (#577)
Maël Kerbiriou
2023-03-29
add example of re-act pattern (#583)
Tobias Lütke
2023-03-29
Fix GCC warning about binary literal (#595)
anzz1
2023-03-29
Fix typo in llama.h (#593)
anzz1
2023-03-28
Enable Fused-Multiply-Add (FMA) and F16C/CVT16 vector extensions on MSVC (#375)
anzz1
2023-03-28
CI: fix subdirectory path globbing (#546)
anzz1
2023-03-28
llama : fix linkage with mingw (#551)
anzz1
2023-03-28
ggml : add AVX2 implementation of quantize_row_q4_1 (#515)
slaren
2023-03-28
py : add temporary script to convert old ggml files to newer version (#539)
thement
2023-03-28
py : add capabiliy to convert from ggml back to torch or hf format for furthe...
Tai Duc Nguyen
2023-03-28
ggml : refactor quantized processing functions (#509)
Stephan Walter
2023-03-28
py : removed unused `model` variable and verified that the code functions cor...
DooWoong Lee (David)
2023-03-28
ci : make ctest verbose, hopefully we see what is wrong with the sanitizer
Georgi Gerganov
2023-03-28
tests : free llama context at the end of the test
Georgi Gerganov
2023-03-28
all : be more strict about converting float to double (#458)
Stephan Walter
2023-03-28
deploy : add a Package.swift for SwiftPM support (#393)
Jed Fox
2023-03-28
ggml : introduce structs for the q4 data blocks (#356)
Stephan Walter
2023-03-28
gitignore : add "embedding"
Georgi Gerganov
2023-03-28
Check the existence of f16_model_path_base in quantize.py (#574)
dotpy314
2023-03-28
Fix usage of F16C intrinsics in AVX code (#563)
slaren
2023-03-28
main.cpp fixes, refactoring (#571)
anzz1
2023-03-28
Add embedding example to Makefile (#540)
RJ Adriaansen
2023-03-27
Fix missing ggml link in cmake for examples/* on w64-mingw32 (#542)
Marco Matthies
2023-03-26
ci: add debug build to sanitizer build matrix (#527)
Erik Scholz
2023-03-26
Fix undefined variables in debug build, remove unused variables (#531)
Stephan Walter
2023-03-26
Add support for linux/arm64 platform during Docker Builds (#514)
Juan Calderon-Perez
2023-03-26
Update README and comments for standalone perplexity tool (#525)
Stephan Walter
2023-03-26
[main] fix infinite generation (-n == -1) (#523)
anzz1
2023-03-26
Add logo to README.md
Georgi Gerganov
2023-03-26
Exit from interactive mode if input stream is bad (#491)
Harald Fernengel
2023-03-26
CI: Run other sanitizer builds even if one fails (#511)
anzz1
2023-03-25
Clarify console output in convert-pth-to-ggml.py (#512)
jp-x-g
2023-03-25
CMake / CI additions (#497)
anzz1
2023-03-25
(Windows) Set console to UTF-8 on init (#420)
anzz1
2023-03-25
Fix colors enabling on WIN32
Georgi Gerganov
2023-03-25
If n_predict == -1, generate forever
Georgi Gerganov
2023-03-25
Inifinite generation via context swapping (#71)
Georgi Gerganov
2023-03-25
Cleanup STL headers + fix embedding examples + minor stuff
Georgi Gerganov
2023-03-25
Move chat scripts into "./examples"
Georgi Gerganov
2023-03-25
Add AVX2 implementation of dequantize_row_q4_1 (#505)
slaren
2023-03-25
Overhaul the examples structure
Georgi Gerganov
2023-03-25
Retire the ggml_mul_mat() branch for transposed src0 (#500)
Georgi Gerganov
2023-03-25
Disable prompt verbosity by default and add option to enable (#480)
Georgi Gerganov
2023-03-25
Add AVX2 implementation of dequantize_row_q4_0 (#467)
slaren
2023-03-25
Don't interefe with BLAS for large prompts by running only 1 thread
Georgi Gerganov
2023-03-25
Add longer DAN prompt for testing big batch numbers
Georgi Gerganov
2023-03-25
Add timings for the prompt evaluation (#478)
slaren
2023-03-25
Remove obsolete information from README
Georgi Gerganov
2023-03-25
Remove obsolete assert and fix compiler warning
Georgi Gerganov
[prev]
[next]