Age | Commit message (Expand) | Author |
---|---|---|
2023-06-17 | metal : add norm, cpy f16->f16, alibi kernels (#1823) | Aaron Miller |
2023-06-15 | metal : parallel command buffer encoding (#1860) | Georgi Gerganov |
2023-06-12 | Metal implementation for all k_quants (#1807) | Kawrakow |
2023-06-12 | metal : fix failure to load model (#1817) | Kawrakow |
2023-06-10 | metal : fix issue with ggml-metal.metal path. Closes #1769 (#1782) | Andrei |
2023-06-10 | metal : add Q4_1 implementation (#1785) | Kawrakow |
2023-06-09 | metal : add GELU implementation (#1770) | AT |
2023-06-09 | metal : faster q4_0 (#1775) | Kawrakow |
2023-06-08 | metal : add Q2_K implementation (#1762) | Kawrakow |
2023-06-08 | metal : Q6_K implementation (#1752) | Kawrakow |
2023-06-08 | metal : add Q4_K implementation (#1733) | Kawrakow |
2023-06-06 | metal : add f16 support | Georgi Gerganov |
2023-06-06 | metal : add checks for buffer size (#1706) | Spencer Sutton |
2023-06-05 | metal : use shared buffers between CPU and GPU (#1696) | kiltyj |
2023-06-04 | llama : Metal inference (#1642) | Georgi Gerganov |