llama.cpp.git - llama.cpp

Age	Commit message (Collapse)	Author
2023-03-24	Revert "Fix memory allocation issues and seg faults"	Georgi Gerganov
	This reverts commit 4870e455b3653f7d7769fa5772b2c90ffad088df. Will provide the correct fix later
2023-03-24	Fix memory allocation issues and seg faults	Georgi Gerganov

2023-03-23	Avoid the transposed X branch in the Z = X * Y matrix multiplication (#439)	Georgi Gerganov
	Should make results reproducible for different number of threads and batch sizes
2023-03-22	Add missing header for memcpy (#386)	Yusuf Kağan Hanoğlu
	fixed: memcpy is not defined
2023-03-22	Init llama_context_params properly from CLI (#370)	Georgi Gerganov

2023-03-22	Introduce C-style API (#370)	Georgi Gerganov
	* Major refactoring - introduce C-style API * Clean up * Add <cassert> * Add <iterator> * Add <algorithm> .... * Fix timing reporting and accumulation * Measure eval time only for single-token calls * Change llama_tokenize return meaning