aboutsummaryrefslogtreecommitdiff
path: root/Makefile
diff options
context:
space:
mode:
authorGeorgi Gerganov <ggerganov@gmail.com>2023-04-15 17:53:22 +0300
committerGitHub <noreply@github.com>2023-04-15 17:53:22 +0300
commite95b6554b493e71a0275764342e09bd5784a7026 (patch)
tree6b9d3e9d4eb23b64ae76f0108b409aa5825cd1b8 /Makefile
parentaa485cee334e84437e21681c14b6f80b65876d8b (diff)
ggml : add Q8_0 quantization for intermediate results (#951)
* ggml : add Q8_0 quantization for intermediate results * quantize-stats : fix test + add it to Makefile default * Q8: use int8_t, AVX/AVX2 optimizations * ggml : fix quantize_row_q8_0() ARM_NEON rounding * minor : updates after rebase to latest master * quantize-stats : delete obsolete strings * ggml : fix q4_1 dot func --------- Co-authored-by: Stephan Walter <stephan@walter.name>
Diffstat (limited to 'Makefile')
-rw-r--r--Makefile2
1 files changed, 1 insertions, 1 deletions
diff --git a/Makefile b/Makefile
index a1b99c6..e7470d5 100644
--- a/Makefile
+++ b/Makefile
@@ -133,7 +133,7 @@ $(info I CC: $(CCV))
$(info I CXX: $(CXXV))
$(info )
-default: main quantize perplexity embedding
+default: main quantize quantize-stats perplexity embedding
#
# Build library