Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-03-24 | Revert "Fix memory allocation issues and seg faults" | Georgi Gerganov | |
This reverts commit 4870e455b3653f7d7769fa5772b2c90ffad088df. Will provide the correct fix later | |||
2023-03-24 | Fix memory allocation issues and seg faults | Georgi Gerganov | |
2023-03-23 | Avoid the transposed X branch in the Z = X * Y matrix multiplication (#439) | Georgi Gerganov | |
Should make results reproducible for different number of threads and batch sizes | |||
2023-03-22 | Add missing header for memcpy (#386) | Yusuf Kağan Hanoğlu | |
fixed: memcpy is not defined | |||
2023-03-22 | Init llama_context_params properly from CLI (#370) | Georgi Gerganov | |
2023-03-22 | Introduce C-style API (#370) | Georgi Gerganov | |
* Major refactoring - introduce C-style API * Clean up * Add <cassert> * Add <iterator> * Add <algorithm> .... * Fix timing reporting and accumulation * Measure eval time only for single-token calls * Change llama_tokenize return meaning |