diff options
author | Georgi Gerganov <ggerganov@gmail.com> | 2023-03-22 07:32:36 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-03-22 07:32:36 +0200 |
commit | f5a77a629bd0f37ae1696747633ab42a5530ec15 (patch) | |
tree | b3d147dd228ce67661ed497a6dc61b444a38e0f9 /ggml.h | |
parent | da0e9fe90ccf6e73597eb19dd0cfc0a28363fb3b (diff) |
Introduce C-style API (#370)
* Major refactoring - introduce C-style API
* Clean up
* Add <cassert>
* Add <iterator>
* Add <algorithm> ....
* Fix timing reporting and accumulation
* Measure eval time only for single-token calls
* Change llama_tokenize return meaning
Diffstat (limited to 'ggml.h')
-rw-r--r-- | ggml.h | 7 |
1 files changed, 7 insertions, 0 deletions
@@ -742,6 +742,13 @@ enum ggml_opt_result ggml_opt( struct ggml_tensor * f); // +// quantization +// + +size_t ggml_quantize_q4_0(float * src, void * dst, int n, int k, int qk, int64_t * hist); +size_t ggml_quantize_q4_1(float * src, void * dst, int n, int k, int qk, int64_t * hist); + +// // system info // |