llama.cpp.git - llama.cpp

Age	Commit message (Collapse)	Author
2023-06-03	Docker: change to calling convert.py (#1641)	Jiří Podivín
	Deprecation disclaimer was added to convert-pth-to-ggml.py
2023-04-26	quantize : use `map` to assign quantization type from `string` (#1191)	Pavol Rusnak
	instead of `int` (while `int` option still being supported) This allows the following usage: `./quantize ggml-model-f16.bin ggml-model-q4_0.bin q4_0` instead of: `./quantize ggml-model-f16.bin ggml-model-q4_0.bin 2`
2023-03-23	Remove oboslete command from Docker script	Georgi Gerganov

2023-03-17	Don't tell users to use a bad number of threads (#243)	Stephan Walter
	The readme tells people to use the command line option "-t 8", causing 8 threads to be started. On systems with fewer than 8 cores, this causes a significant slowdown. Remove the option from the example command lines and use /proc/cpuinfo on Linux to determine a sensible default.
2023-03-17	🚀 Dockerize llamacpp (#132)	Bernat Vadell
	* feat: dockerize llamacpp * feat: split build & runtime stages * split dockerfile into main & tools * add quantize into tool docker image * Update .devops/tools.sh Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * add docker action pipeline * change CI to publish at github docker registry * fix name runs-on macOS-latest is macos-latest (lowercase) * include docker versioned images * fix github action docker * fix docker.yml * feat: include all-in-one command tool & update readme.md --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>