aboutsummaryrefslogtreecommitdiff
path: root/ggml-cuda.cu
diff options
context:
space:
mode:
authorKawrakow <48489457+ikawrakow@users.noreply.github.com>2023-07-14 12:46:21 +0300
committerGitHub <noreply@github.com>2023-07-14 11:46:21 +0200
commit27ad57a69b85bf12420a27e9945e580cc280be57 (patch)
treef73b384b82088c94526a80a0eef1544eee2b1df7 /ggml-cuda.cu
parent32c54116318929c90fd7ae814cf9b5232cd44c36 (diff)
Metal: faster Q4_0 and Q4_1 matrix x vector kernels (#2212)
* 3-5% faster Q4_0 on Metal * 7-25% faster Q4_1 on Metal * Oops, forgot to delete the original Q4_1 kernel --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'ggml-cuda.cu')
0 files changed, 0 insertions, 0 deletions