Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-03-28 | ggml : introduce structs for the q4 data blocks (#356) | Stephan Walter | |
* Introduce structs for the q4 data blocks * ggml : rename quant struct variables + fix ARM_NEON --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> | |||
2023-03-22 | Deduplicate q4 quantization functions (#383) | Stephan Walter | |
* Deduplicate q4 quantization functions * Use const; add basic test * Re-enable quantization test * Disable AVX2 flags in CI --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> |