aboutsummaryrefslogtreecommitdiff
path: root/Makefile
AgeCommit message (Expand)Author
2023-05-28LLAMA_DEBUG adds debug symbols (#1617)Johannes Gäßler
2023-05-27Include server in releases + other build system cleanups (#1610)Kerfuffle
2023-05-26cuda : performance optimizations (#1530)Johannes Gäßler
2023-05-23OpenCL Token Generation Acceleration (#1459)0cc4m
2023-05-21make : .PHONY clean (#1553)Stefan Sydow
2023-05-20feature : support blis and other blas implementation (#1536)Zenix
2023-05-20Revert "feature : add blis and other BLAS implementation support (#1502)"Georgi Gerganov
2023-05-20feature : add blis and other BLAS implementation support (#1502)Zenix
2023-05-16Add alternate include path for openblas (#1476)sandyiscool
2023-05-13make : fix PERF build with cuBLASGeorgi Gerganov
2023-05-05makefile: automatic Arch Linux detection (#1332)DaniAndTheWeb
2023-05-05Fix for OpenCL / clbast builds on macOS. (#1329)Ionoclast Laboratories
2023-05-02Call sh on build-info.sh (#1294)DannyDaemonic
2023-05-01Add git-based build information for better issue tracking (#1232)DannyDaemonic
2023-04-30build: add armv{6,7,8} support to cmake (#1251)Pavol Rusnak
2023-04-30Various fixes to mat_mul benchmark (#1253)Stephan Walter
2023-04-29ggml : adjust mul_mat_f16 work memory (#1226)Georgi Gerganov
2023-04-29build : fix reference to old llama_util.hGeorgi Gerganov
2023-04-29cuBLAS: use host pinned memory and dequantize while copying (#1207)slaren
2023-04-28ggml : add CLBlast support (#1164)0cc4m
2023-04-28Add Manjaro CUDA include and lib dirs to Makefile (#1212)Johannes Gäßler
2023-04-24Fix cuda compilation (#1128)slaren
2023-04-23ggml : better PERF prints + support "LLAMA_PERF=1 make"Georgi Gerganov
2023-04-22ggml : fix AVX build + update to new Q8_0 formatGeorgi Gerganov
2023-04-21Improve cuBLAS performance by using a memory pool (#1094)slaren
2023-04-20Add Q4_3 support to cuBLAS (#1086)slaren
2023-04-20fix: LLAMA_CUBLAS=1 undefined reference 'shm_open' (#1080)源文雨
2023-04-20Improve cuBLAS performance by dequantizing on the GPU (#1065)slaren
2023-04-19ggml : Q4 cleanup - remove 4-bit dot product code (#1061)Stephan Walter
2023-04-19Add NVIDIA cuBLAS support (#1044)slaren
2023-04-18Adding a simple program to measure speed of dot products (#1041)Kawrakow
2023-04-15ggml : add Q8_0 quantization for intermediate results (#951)Georgi Gerganov
2023-04-14make : fix dependencies, use auto variables (#983)Stephan Walter
2023-04-13llama : merge llama_internal.h into llama.hGeorgi Gerganov
2023-04-13fix whitespace (#944)CRD716
2023-04-13benchmark : add tool for timing q4_0 matrix multiplication (#653)SebastianApel
2023-04-10Rewrite loading code to try to satisfy everyone:comex
2023-04-08Add quantize-stats command for testing quantization (#728)unbounded
2023-04-07make : add libllama.so target for llama-cpp-python (#797)bhubbb
2023-04-05make : missing host optimizations in CXXFLAGS (#763)Ivan Stepanov
2023-04-02make : use -march=native -mtune=native on x86 (#609)Fabian
2023-03-30make : fix darwin f16c flags check (#615)david raistrick
2023-03-28all : be more strict about converting float to double (#458)Stephan Walter
2023-03-28Add embedding example to Makefile (#540)RJ Adriaansen
2023-03-25Overhaul the examples structureGeorgi Gerganov
2023-03-24additional optimizations for POWER9 (#454)Cameron Kaiser
2023-03-23Fix Makefile echo escape codes (by removing them). (#418)Kerfuffle
2023-03-22Introduce C-style API (#370)Georgi Gerganov
2023-03-21makefile: Fix CPU feature detection on Haiku (#218)Alex von Gluck IV
2023-03-21Add OpenBSD support (#314)Kevin Lo