aboutsummaryrefslogtreecommitdiff
path: root/quantize.cpp
diff options
context:
space:
mode:
authorGeorgi Gerganov <ggerganov@gmail.com>2023-03-25 19:47:21 +0200
committerGitHub <noreply@github.com>2023-03-25 19:47:21 +0200
commitecbe466a364876927994e2f1ec14f4d82301d201 (patch)
tree618c3e4e3f178f20bdc219e80b455ff45381ada5 /quantize.cpp
parent502a400192013d3e95ed87b777e8fa3bec45713c (diff)
Retire the ggml_mul_mat() branch for transposed src0 (#500)
* Retire the ggml_mul_mat() for transposed src0 - It can always be made contiguous with ggml_cpy() - The code is now simplified - The results are deterministic in respect to num threads * SIMD-ify dequantize_row_q4_0() for ARM_NEON (#502) * Attempt to SIMD-ify dequantize_row_q4_0() for ARM_NEON * Fix dequantization - forgot to interleave the quants
Diffstat (limited to 'quantize.cpp')
0 files changed, 0 insertions, 0 deletions