llama.cpp.git - llama.cpp

Age	Commit message (Collapse)	Author
2023-03-13	Revert "10% performance boost on ARM"	Georgi Gerganov
	This reverts commit 113a9e83ebc0f788f861394437087bf3ca0e019b. There are some reports for illegal instruction. Moved this stuff to vdotq_s32 branch until resolve
2023-03-13	Check for vdotq_s32 availability	Georgi Gerganov

2023-03-13	Ammend to previous commit - forgot to update non-QRDMX branch	Georgi Gerganov

2023-03-13	10% performance boost on ARM	Georgi Gerganov

2023-03-12	Windows fixes (#31)	Sebastián A
	* Apply fixes suggested to build on windows Issue: https://github.com/ggerganov/llama.cpp/issues/22 * Remove unsupported VLAs * MSVC: Remove features that are only available on MSVC C++20. * Fix zero initialization of the other fields. * Change the use of vector for stack allocations.
2023-03-11	Add AVX2 support for x86 architectures thanks to @Const-me !	Georgi Gerganov

2023-03-11	Support all LLaMA models + change Q4_0 quantization storage	Georgi Gerganov

2023-03-10	Initial release	Georgi Gerganov