aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2023-04-17quantize-stats : fix bug in --type argumentGeorgi Gerganov
2023-04-17ggml : avoid using ggml_fp16_to_fp32() and ggml_fp32_to_fp16() in ggml.cGeorgi Gerganov
2023-04-17Speedup the AVX-512 implementation of ggml_vec_dot_q4_0() (#933)Ivan Komarov
2023-04-16Fix: do not close file on mmap (#1017)slaren
2023-04-16stdout : vertical align outputs for better readibilityGeorgi Gerganov
2023-04-16examples: add missing <ctime> include for time() (#1011)Pavol Rusnak
2023-04-16Fix msys2 build error and warnings (#1009)nanahi
2023-04-15convert.py: Fix loading safetensors and ggml format on Windows (#991)comex
2023-04-15Fix potential int8 overflow in non-SIMD vec_dot (#986)Stephan Walter
2023-04-15Refactor ggml.c for future tensor types (#1001)Stephan Walter
2023-04-15ggml : add Q8_0 quantization for intermediate results (#951)Georgi Gerganov
2023-04-15ggml : use posix_memalign on non-Windows envGeorgi Gerganov
2023-04-15benchmark : fix result validation in benchmark-q4_0-matmult (#987)Ivan Komarov
2023-04-15cmake : add finding the OpenBLAS header file (#992)katsu560
2023-04-14Revert "main : alternative instruct mode (Vicuna support, etc.) (#863)" (#982)Pavol Rusnak
2023-04-14py : bump sentencepiece to 0.1.98 to support Python 3.11 (#976)Pavol Rusnak
2023-04-14make : fix dependencies, use auto variables (#983)Stephan Walter
2023-04-14Expose type name from ggml (#970)Pavol Rusnak
2023-04-14main : alternative instruct mode (Vicuna support, etc.) (#863)Tomáš Pazdiora
2023-04-14ggml : add unary and binary map operations (#874)Kerfuffle
2023-04-14py : cleanup dependencies (#962)Pavol Rusnak
2023-04-14py : fix flake8 and isort nitpicks (#960)Pavol Rusnak
2023-04-14ggml : minorGeorgi Gerganov
2023-04-14ggml : always allocate buffers with size multiple of GGML_MEM_ALIGNGeorgi Gerganov
2023-04-14py : new conversion script (#545)comex
2023-04-14ggml : fix q4_1 dot product typesGeorgi Gerganov
2023-04-14ggml : optimize rope function to avoid call powf in the tight loop (#807)Howard Su
2023-04-14perplexity : add support for batch size to `--perplexity` (#407)Gary Linscott
2023-04-13common : remove unnecessary includes (#947)CRD716
2023-04-13ggml : add GGML_DEFAULT_N_THREADSGeorgi Gerganov
2023-04-13ggml : speed-up ggml_vec_dot_q4_1() ARM_NEON + 32-bit ARM support (#900)Georgi Gerganov
2023-04-13llama : merge llama_internal.h into llama.hGeorgi Gerganov
2023-04-13gitignore : benchmarkGeorgi Gerganov
2023-04-13ggml : optimize non-SIMD Q4_0 vector dot product (#703)Stephan Walter
2023-04-13ggml : introduce GGML_ALIGNED_MALLOC/GGML_ALIGNED_FREE macros (#884)Pavol Rusnak
2023-04-13fix whitespace (#944)CRD716
2023-04-13readme : remove python 3.10 warning (#929)CRD716
2023-04-13readme : llama node binding (#911)Genkagaku.GPT
2023-04-13flake.nix: add all binaries from bin (#848)Pavol Rusnak
2023-04-13zig : update build.zig (#872)Judd
2023-04-13ggml : update cblas_sgemm columns var to be more reasonable (#838)Vladimir
2023-04-13examples : add -n to alpaca and gpt4all scripts (#706)niansa/tuxifan
2023-04-13cmake : add explicit F16C option (x86) (#576)anzz1
2023-04-13benchmark : add tool for timing q4_0 matrix multiplication (#653)SebastianApel
2023-04-13do not force the prompt file to end with a new line (#908)Pavol Rusnak
2023-04-12Don't crash on ftype (formerly f16) == 4 (#917)Stephan Walter
2023-04-12readme : change "GPU support" link to discussionGeorgi Gerganov
2023-04-12readme : update hot topics with link to "GPU support" issueGeorgi Gerganov
2023-04-12readme: link to sha256sums file (#902)Nicolai Weitkemper
2023-04-11Fix whitespace, add .editorconfig, add GitHub workflow (#883)Pavol Rusnak