llama.cpp.git - llama.cpp

Age	Commit message (Collapse)	Author
2023-03-17	Q4_1 quantization (#193)	Matvey Soloviev
	* Add AVX2 version of ggml_vec_dot_q4_1 * Small optimisations to q4_1 dot product (@Const-me) * Rearrange Q4_1 quantization to work for multipart models. (Fix #152) * Fix ggml_vec_mad_q4_1 too * Fix non-vectorised q4_1 vec mul
2023-03-15	added ctx_size parameter (#148)	Justin Suess
	* added ctx_size parameter * added it in more places * Apply suggestions from code review --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-13	Add NetBSD support. (#90)	Thomas Klausner

2023-03-12	Add interactive mode (#61)	Matvey Soloviev
	* Initial work on interactive mode. * Improve interactive mode. Make rev. prompt optional. * Update README to explain interactive mode. * Fix OS X build
2023-03-12	Allow using prompt files (#59)	Ben Garney

2023-03-12	Add back top_k (#56)	beiller
	* Add back top_k * Update utils.cpp * Update utils.h --------- Co-authored-by: Bill Hamilton <bill.hamilton@shopify.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-12	Windows fixes (#31)	Sebastián A
	* Apply fixes suggested to build on windows Issue: https://github.com/ggerganov/llama.cpp/issues/22 * Remove unsupported VLAs * MSVC: Remove features that are only available on MSVC C++20. * Fix zero initialization of the other fields. * Change the use of vector for stack allocations.
2023-03-12	Add repetition penalty (#20)	beiller
	* Adding repeat penalization * Update utils.h * Update utils.cpp * Numeric fix Should probably still scale by temp even if penalized * Update comments, more proper application I see that numbers can go negative so a fix from a referenced commit * Minor formatting --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-11	Support all LLaMA models + change Q4_0 quantization storage	Georgi Gerganov

2023-03-11	Add missing headers for memcpy and assert (#3)	Jean-Michaël Celerier

2023-03-10	Fix a bug in the rope calculation	Georgi Gerganov

2023-03-10	Final touches	Georgi Gerganov

2023-03-10	Initial release	Georgi Gerganov