llama.cpp.git - llama.cpp

Age	Commit message (Expand)	Author
2023-03-30	ggml : fix NEON signs (close #620, #622)	Georgi Gerganov
2023-03-30	Fix GGML_F32Cx8_STORE in AVX without F16C path (#619)	slaren
2023-03-29	ci : re-enable AVX512 testing (Windows-MSVC) (#584)	anzz1
2023-03-29	ggml : init time on first ggml_init() call	Georgi Gerganov
2023-03-29	llama : fix compile warnings when reading the vocab	Georgi Gerganov
2023-03-29	ggml : add ARM_NEON dequantize_row_q4_1()	Georgi Gerganov
2023-03-29	ggml : add ARM_NEON quantize_row_q4_1()	Georgi Gerganov
2023-03-29	ggml : add ARM_NEON ggml_vec_dot_q4_1()	Georgi Gerganov
2023-03-29	rename convert_ggml_to_pth.py -> convert-ggml-to-pth.py (#600)	Pavol Rusnak
2023-03-29	Create chat-13B.bat (#592)	Thérence
2023-03-29	readme : fix typos	Georgi Gerganov
2023-03-29	readme : add GPT4All instructions (close #588)	Georgi Gerganov
2023-03-29	py : add GPT4All conversion script	Georgi Gerganov
2023-03-29	llama : use the same threshold for OpenBLAS and ggml thread limiting (#577)	Maël Kerbiriou
2023-03-29	add example of re-act pattern (#583)	Tobias Lütke
2023-03-29	Fix GCC warning about binary literal (#595)	anzz1
2023-03-29	Fix typo in llama.h (#593)	anzz1
2023-03-28	Enable Fused-Multiply-Add (FMA) and F16C/CVT16 vector extensions on MSVC (#375)	anzz1
2023-03-28	CI: fix subdirectory path globbing (#546)	anzz1
2023-03-28	llama : fix linkage with mingw (#551)	anzz1
2023-03-28	ggml : add AVX2 implementation of quantize_row_q4_1 (#515)	slaren
2023-03-28	py : add temporary script to convert old ggml files to newer version (#539)	thement
2023-03-28	py : add capabiliy to convert from ggml back to torch or hf format for furthe...	Tai Duc Nguyen
2023-03-28	ggml : refactor quantized processing functions (#509)	Stephan Walter
2023-03-28	py : removed unused `model` variable and verified that the code functions cor...	DooWoong Lee (David)
2023-03-28	ci : make ctest verbose, hopefully we see what is wrong with the sanitizer	Georgi Gerganov
2023-03-28	tests : free llama context at the end of the test	Georgi Gerganov
2023-03-28	all : be more strict about converting float to double (#458)	Stephan Walter
2023-03-28	deploy : add a Package.swift for SwiftPM support (#393)	Jed Fox
2023-03-28	ggml : introduce structs for the q4 data blocks (#356)	Stephan Walter
2023-03-28	gitignore : add "embedding"	Georgi Gerganov
2023-03-28	Check the existence of f16_model_path_base in quantize.py (#574)	dotpy314
2023-03-28	Fix usage of F16C intrinsics in AVX code (#563)	slaren
2023-03-28	main.cpp fixes, refactoring (#571)	anzz1
2023-03-28	Add embedding example to Makefile (#540)	RJ Adriaansen
2023-03-27	Fix missing ggml link in cmake for examples/* on w64-mingw32 (#542)	Marco Matthies
2023-03-26	ci: add debug build to sanitizer build matrix (#527)	Erik Scholz
2023-03-26	Fix undefined variables in debug build, remove unused variables (#531)	Stephan Walter
2023-03-26	Add support for linux/arm64 platform during Docker Builds (#514)	Juan Calderon-Perez
2023-03-26	Update README and comments for standalone perplexity tool (#525)	Stephan Walter
2023-03-26	[main] fix infinite generation (-n == -1) (#523)	anzz1
2023-03-26	Add logo to README.md	Georgi Gerganov
2023-03-26	Exit from interactive mode if input stream is bad (#491)	Harald Fernengel
2023-03-26	CI: Run other sanitizer builds even if one fails (#511)	anzz1
2023-03-25	Clarify console output in convert-pth-to-ggml.py (#512)	jp-x-g
2023-03-25	CMake / CI additions (#497)	anzz1
2023-03-25	(Windows) Set console to UTF-8 on init (#420)	anzz1
2023-03-25	Fix colors enabling on WIN32	Georgi Gerganov
2023-03-25	If n_predict == -1, generate forever	Georgi Gerganov
2023-03-25	Inifinite generation via context swapping (#71)	Georgi Gerganov