aboutsummaryrefslogtreecommitdiff
path: root/ggml-metal.metal
AgeCommit message (Expand)Author
2023-07-21Faster Q3_K implementation on Metal (#2307)Kawrakow
2023-07-21Faster Q2_K on Metal (#2297)Kawrakow
2023-07-20Faster Q5_K and Q6_K on Metal (#2294)Kawrakow
2023-07-20Faster Q4_K on Metal (#2290)Kawrakow
2023-07-20metal: minor q4 optimization and reduce code size (#2248)Shouzheng Liu
2023-07-15llama : add custom RoPE (#2054)Xiao-Yong Jin
2023-07-14Metal: faster Q4_0 and Q4_1 matrix x vector kernels (#2212)Kawrakow
2023-07-12metal : new q4_0 matrix-vector kernel (#2188)Shouzheng Liu
2023-06-26k-quants : support for super-block size of 64 (#2001)Kawrakow
2023-06-17metal : add norm, cpy f16->f16, alibi kernels (#1823)Aaron Miller
2023-06-12Metal implementation for all k_quants (#1807)Kawrakow
2023-06-10metal : add Q4_1 implementation (#1785)Kawrakow
2023-06-09metal : fix build "tanhf" -> "tanh"Georgi Gerganov
2023-06-09metal : add GELU implementation (#1770)AT
2023-06-09metal : faster q4_0 (#1775)Kawrakow
2023-06-08metal : add Q2_K implementation (#1762)Kawrakow
2023-06-08metal : Q6_K implementation (#1752)Kawrakow
2023-06-08metal : add Q4_K implementation (#1733)Kawrakow
2023-06-06metal : add f16 supportGeorgi Gerganov
2023-06-04llama : Metal inference (#1642)Georgi Gerganov