Age | Commit message (Expand) | Author |
---|---|---|
2023-04-10 | ggml : add ggml_cont() + optimize ggml_cpy() for contiguous dst | Georgi Gerganov |
2023-04-10 | Rewrite loading code to try to satisfy everyone: | comex |
2023-04-08 | Add quantize-stats command for testing quantization (#728) | unbounded |
2023-04-05 | ggml, llama : avoid heavy V transpose + improvements (#775) | Georgi Gerganov |
2023-04-02 | ggml : change ne to int64_t (#626) | Marian Cepok |
2023-03-30 | Ensure --mlock works properly with mmap() support | Justine Tunney |
2023-03-30 | Add mmap support for model files | Slaren |
2023-03-28 | ggml : introduce structs for the q4 data blocks (#356) | Stephan Walter |
2023-03-24 | Support calling mlock() on loaded model data on Linux and macOS (#453) | comex |
2023-03-22 | Deduplicate q4 quantization functions (#383) | Stephan Walter |
2023-03-22 | Introduce C-style API (#370) | Georgi Gerganov |
2023-03-16 | Add RMS norm and use it (#187) | hoangmit |
2023-03-10 | Initial release | Georgi Gerganov |