Age | Commit message (Expand) | Author |
---|---|---|
2023-04-08 | Add quantize-stats command for testing quantization (#728) | unbounded |
2023-04-02 | Added api for getting/setting the kv_cache (#685) | Christian Falch |
2023-03-30 | Make loading weights 10-100x faster | Justine Tunney |
2023-03-29 | Fix typo in llama.h (#593) | anzz1 |
2023-03-28 | llama : fix linkage with mingw (#551) | anzz1 |
2023-03-28 | all : be more strict about converting float to double (#458) | Stephan Walter |
2023-03-28 | ggml : introduce structs for the q4 data blocks (#356) | Stephan Walter |
2023-03-25 | Cleanup STL headers + fix embedding examples + minor stuff | Georgi Gerganov |
2023-03-25 | Add support for file load progress reporting callbacks (#434) | Jed Fox |
2023-03-25 | Add missing struct annotation (#483) | Doomsdayrs |
2023-03-24 | Support calling mlock() on loaded model data on Linux and macOS (#453) | comex |
2023-03-24 | Add embedding mode with arg flag. Currently working (#282) | Luciano |
2023-03-22 | Introduce C-style API (#370) | Georgi Gerganov |