llama.cpp.git - llama.cpp

Age	Commit message (Expand)	Author
2023-03-13	Use vdotq_s32 to improve performance (#67)	Georgi Gerganov
2023-03-13	Reduce model loading time (#43)	uint256_t
2023-03-13	Fix UTF-8 handling (including colors) (#79)	Val Kharitonov
2023-03-13	Add quantize script for batch quantization (#92)	Pavol Rusnak
2023-03-13	Add initial contribution guidelines	Georgi Gerganov
2023-03-13	Gate signal support on being on a unixoid system. (#74)	Matvey Soloviev
2023-03-13	Fix token count accounting	Matvey Soloviev
2023-03-13	Revert "10% performance boost on ARM"	Georgi Gerganov
2023-03-13	Check for vdotq_s32 availability	Georgi Gerganov
2023-03-13	Ammend to previous commit - forgot to update non-QRDMX branch	Georgi Gerganov
2023-03-13	10% performance boost on ARM	Georgi Gerganov
2023-03-13	Fix color getting reset before prompt output done (#65)	Matvey Soloviev
2023-03-12	Update README.md	Georgi Gerganov
2023-03-12	Add interactive mode (#61)	Matvey Soloviev
2023-03-12	Fix typo in README (#45)	Marc Köhlbrugge
2023-03-12	Allow using prompt files (#59)	Ben Garney
2023-03-12	Add back top_k (#56)	beiller
2023-03-12	Windows fixes (#31)	Sebastián A
2023-03-12	Update README.md	Georgi Gerganov
2023-03-12	Add CI (#60)	Georgi Gerganov
2023-03-12	Revert "weights_only" arg - this causing more trouble than help	Georgi Gerganov
2023-03-12	python/pytorch compat notes (#44)	Oleksandr Nikitin
2023-03-12	Add repetition penalty (#20)	beiller
2023-03-12	Clarify meaning of hacking	Georgi Gerganov
2023-03-12	README: add "Supported platforms" + update hot topics	Georgi Gerganov
2023-03-12	use weights_only in conversion script (#32)	deepdiffuser
2023-03-12	Add LICENSE (#21)	Pavol Rusnak
2023-03-12	Update README.md	Georgi Gerganov
2023-03-11	Fix a typo in model name (#16)	Juraj Bednar
2023-03-11	Update README.md	Georgi Gerganov
2023-03-11	Add AVX2 support for x86 architectures thanks to @Const-me !	Georgi Gerganov
2023-03-11	Fix un-initialized FP16 tables on x86 (#15, #2)	Georgi Gerganov
2023-03-11	Bump memory buffer	Georgi Gerganov
2023-03-11	Update README.md	Georgi Gerganov
2023-03-11	.gitignore models/	Georgi Gerganov
2023-03-11	Update Makefile var + add comment	Georgi Gerganov
2023-03-11	Update README.md	Georgi Gerganov
2023-03-11	Update README.md	Georgi Gerganov
2023-03-11	Support all LLaMA models + change Q4_0 quantization storage	Georgi Gerganov
2023-03-11	Include Python dependencies in README (#6)	Simon Willison
2023-03-11	Update README.md	Georgi Gerganov
2023-03-11	Update README.md	Georgi Gerganov
2023-03-11	Update README.md	Georgi Gerganov
2023-03-11	Add missing headers for memcpy and assert (#3)	Jean-Michaël Celerier
2023-03-11	Update README.md	Georgi Gerganov
2023-03-11	Update README.md	Georgi Gerganov
2023-03-11	Update README.md	Georgi Gerganov
2023-03-10	Update README.md	Georgi Gerganov
2023-03-10	Fix a bug in the rope calculation	Georgi Gerganov
2023-03-10	Update README.md	Georgi Gerganov