aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2023-03-30Introduce GGML migration tool for new file formatJustine Tunney
2023-03-30Ensure --mlock works properly with mmap() supportJustine Tunney
2023-03-30Make loading weights 10-100x fasterJustine Tunney
2023-03-30Initial windows support (untested)Slaren
2023-03-30Always initialize mm_addr and mm_length in llama_modelSlaren
2023-03-30Unmap the file in llama_freeSlaren
2023-03-30Make mmap_file staticSlaren
2023-03-30Fix ggml_init_params in quantizeSlaren
2023-03-30Add mmap support for model filesSlaren
2023-03-30cmake : properly invoke CTest (#629)Stephan Walter
2023-03-30Remove unused variable (#607)Casey Primozic
2023-03-30make : fix darwin f16c flags check (#615)david raistrick
2023-03-30ggml : fix NEON signs (close #620, #622)Georgi Gerganov
2023-03-30Fix GGML_F32Cx8_STORE in AVX without F16C path (#619)slaren
2023-03-29ci : re-enable AVX512 testing (Windows-MSVC) (#584)anzz1
2023-03-29ggml : init time on first ggml_init() callGeorgi Gerganov
2023-03-29llama : fix compile warnings when reading the vocabGeorgi Gerganov
2023-03-29ggml : add ARM_NEON dequantize_row_q4_1()Georgi Gerganov
2023-03-29ggml : add ARM_NEON quantize_row_q4_1()Georgi Gerganov
2023-03-29ggml : add ARM_NEON ggml_vec_dot_q4_1()Georgi Gerganov
2023-03-29rename convert_ggml_to_pth.py -> convert-ggml-to-pth.py (#600)Pavol Rusnak
2023-03-29Create chat-13B.bat (#592)Thérence
2023-03-29readme : fix typosGeorgi Gerganov
2023-03-29readme : add GPT4All instructions (close #588)Georgi Gerganov
2023-03-29py : add GPT4All conversion scriptGeorgi Gerganov
2023-03-29llama : use the same threshold for OpenBLAS and ggml thread limiting (#577)Maël Kerbiriou
2023-03-29add example of re-act pattern (#583)Tobias Lütke
2023-03-29Fix GCC warning about binary literal (#595)anzz1
2023-03-29Fix typo in llama.h (#593)anzz1
2023-03-28Enable Fused-Multiply-Add (FMA) and F16C/CVT16 vector extensions on MSVC (#375)anzz1
2023-03-28CI: fix subdirectory path globbing (#546)anzz1
2023-03-28llama : fix linkage with mingw (#551)anzz1
2023-03-28ggml : add AVX2 implementation of quantize_row_q4_1 (#515)slaren
2023-03-28py : add temporary script to convert old ggml files to newer version (#539)thement
2023-03-28py : add capabiliy to convert from ggml back to torch or hf format for furthe...Tai Duc Nguyen
2023-03-28ggml : refactor quantized processing functions (#509)Stephan Walter
2023-03-28py : removed unused `model` variable and verified that the code functions cor...DooWoong Lee (David)
2023-03-28ci : make ctest verbose, hopefully we see what is wrong with the sanitizerGeorgi Gerganov
2023-03-28tests : free llama context at the end of the testGeorgi Gerganov
2023-03-28all : be more strict about converting float to double (#458)Stephan Walter
2023-03-28deploy : add a Package.swift for SwiftPM support (#393)Jed Fox
2023-03-28ggml : introduce structs for the q4 data blocks (#356)Stephan Walter
2023-03-28gitignore : add "embedding"Georgi Gerganov
2023-03-28Check the existence of f16_model_path_base in quantize.py (#574)dotpy314
2023-03-28Fix usage of F16C intrinsics in AVX code (#563)slaren
2023-03-28main.cpp fixes, refactoring (#571)anzz1
2023-03-28Add embedding example to Makefile (#540)RJ Adriaansen
2023-03-27Fix missing ggml link in cmake for examples/* on w64-mingw32 (#542)Marco Matthies
2023-03-26ci: add debug build to sanitizer build matrix (#527)Erik Scholz
2023-03-26Fix undefined variables in debug build, remove unused variables (#531)Stephan Walter