llama.cpp.git - llama.cpp

Age	Commit message (Expand)	Author
2023-08-02	readme : add Aquila-7B model series to supported models (#2487)	ldwang
2023-08-02	readme : Add Chinese LLaMA-2 / Alpaca-2 to supported models (#2475)	Yiming Cui
2023-07-31	CUDA: mmq CLI option, fixed mmq build issues (#2453)	Johannes Gäßler
2023-07-29	CUDA: Quantized matrix matrix multiplication (#2160)	Johannes Gäßler
2023-07-28	Obtaining LLaMA 2 instructions (#2308)	niansa/tuxifan
2023-07-23	Fix __dp4a documentation (#2348)	Johannes Gäßler
2023-07-23	make : fix CLBLAST compile support in FreeBSD (#2331)	Jose Maldonado
2023-07-21	flake : remove intel mkl from flake.nix due to missing files (#2277)	wzy
2023-07-19	flake : update flake.nix (#2270)	wzy
2023-07-16	py : turn verify-checksum-models.py into executable (#2245)	Jiří Podivín
2023-07-11	readme : fix zig build instructions (#2171)	Chad Brewbaker
2023-07-10	mpi : add support for distributed inference via MPI (#2099)	Evan Miller
2023-07-09	readme : update Termux instructions (#2147)	JackJollimore
2023-07-09	readme : add more docs indexes (#2127)	rankaiyx
2023-07-07	docker : add support for CUDA in docker (#1461)	dylan
2023-07-06	convert : update for baichuan (#2081)	Judd
2023-07-05	Quantized dot products for CUDA mul mat vec (#2067)	Johannes Gäßler
2023-07-04	readme : add link web chat PR	Georgi Gerganov
2023-07-01	convert : add support of baichuan-7b (#2055)	Judd
2023-06-26	readme : add Scala 3 bindings repo (#2010)	Roman Parykin
2023-06-26	readme : LD_LIBRARY_PATH complement for some Android devices when building wi...	Gustavo Rocha Dias
2023-06-26	readme : add link to new k-quants for visibility	Georgi Gerganov
2023-06-25	readme : add new roadmap + manifesto	Georgi Gerganov
2023-06-25	readme : add Azure CI discussion link	Georgi Gerganov
2023-06-24	readme : fix whitespaces	Georgi Gerganov
2023-06-24	readme : fixed termux instructions (#1973)	Alberto
2023-06-23	Add OpenLLaMA instructions to the README (#1954)	eiery
2023-06-21	Fix typo in README.md (#1961)	Rahul Vivek Nair
2023-06-20	readme : add link to p1	Georgi Gerganov
2023-06-20	Fix typo (#1949)	Xiake Sun
2023-06-19	Convert vector to f16 for dequantize mul mat vec (#1913)	Johannes Gäßler
2023-06-18	readme : update Android build instructions (#1922)	Mike
2023-06-17	Only one CUDA stream per device for async compute (#1898)	Johannes Gäßler
2023-06-17	readme : alternative way to build for Android with CLBlast. (#1828)	Gustavo Rocha Dias
2023-06-10	doc : fix wrong address of BLIS.md (#1772)	Aisuko
2023-06-07	readme : add June roadmap	Georgi Gerganov
2023-06-05	docs : add performance troubleshoot + example benchmark documentation (#1674)	Yuval Peled
2023-06-05	readme : fix typo (#1700)	Foul-Tarnished
2023-06-04	readme : update hot topics	Georgi Gerganov
2023-06-04	llama : Metal inference (#1642)	Georgi Gerganov
2023-06-03	Add info about CUDA_VISIBLE_DEVICES (#1682)	Henri Vasserman
2023-05-27	Add documentation about CLBlast (#1604)	Henri Vasserman
2023-05-24	readme : add docs for chat-persistent.sh (#1568)	Evan Jones
2023-05-20	feature : support blis and other blas implementation (#1536)	Zenix
2023-05-20	Revert "feature : add blis and other BLAS implementation support (#1502)"	Georgi Gerganov
2023-05-20	feature : add blis and other BLAS implementation support (#1502)	Zenix
2023-05-19	ggml : use F16 instead of F32 in Q4_0, Q4_1, Q8_0 (#1508)	Georgi Gerganov
2023-05-19	readme : adds WizardLM to the list of supported models (#1485)	David Kennedy
2023-05-13	readme : update Q4_0 perplexities	Georgi Gerganov
2023-05-12	readme : add C#/.NET bindings repo (#1409)	Rinne