Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-03-30 | Initial windows support (untested) | Slaren | |
2023-03-30 | Always initialize mm_addr and mm_length in llama_model | Slaren | |
2023-03-30 | Unmap the file in llama_free | Slaren | |
2023-03-30 | Make mmap_file static | Slaren | |
2023-03-30 | Fix ggml_init_params in quantize | Slaren | |
2023-03-30 | Add mmap support for model files | Slaren | |
2023-03-30 | cmake : properly invoke CTest (#629) | Stephan Walter | |
2023-03-30 | Remove unused variable (#607) | Casey Primozic | |
* It seems some new warning were added recently that exposed this. I wrote the code that included this unused variable originally and it is indeed not needed. | |||
2023-03-30 | make : fix darwin f16c flags check (#615) | david raistrick | |
...there was no check. ported upstream from https://github.com/zanussbaum/gpt4all.cpp/pull/2 (I dont see any clean path for upstream patches) | |||
2023-03-30 | ggml : fix NEON signs (close #620, #622) | Georgi Gerganov | |
2023-03-30 | Fix GGML_F32Cx8_STORE in AVX without F16C path (#619) | slaren | |
2023-03-29 | ci : re-enable AVX512 testing (Windows-MSVC) (#584) | anzz1 | |
* CI: Re-enable AVX512 testing (Windows-MSVC) Now with 100% less base64 encoding * plain __cpuid is enough here | |||
2023-03-29 | ggml : init time on first ggml_init() call | Georgi Gerganov | |
2023-03-29 | llama : fix compile warnings when reading the vocab | Georgi Gerganov | |
2023-03-29 | ggml : add ARM_NEON dequantize_row_q4_1() | Georgi Gerganov | |
2023-03-29 | ggml : add ARM_NEON quantize_row_q4_1() | Georgi Gerganov | |
2023-03-29 | ggml : add ARM_NEON ggml_vec_dot_q4_1() | Georgi Gerganov | |
2023-03-29 | rename convert_ggml_to_pth.py -> convert-ggml-to-pth.py (#600) | Pavol Rusnak | |
to match filenames of other converters | |||
2023-03-29 | Create chat-13B.bat (#592) | Thérence | |
* Create chat-13B.bat Same script than chat-13B.sh, but for windows users. Tested and working on windows 10/11 v 22H2 * Apply suggestions from code review --------- Co-authored-by: anzz1 <anzz1@live.com> | |||
2023-03-29 | readme : fix typos | Georgi Gerganov | |
2023-03-29 | readme : add GPT4All instructions (close #588) | Georgi Gerganov | |
2023-03-29 | py : add GPT4All conversion script | Georgi Gerganov | |
For now: copy-paste Too much time for me to deduplicate the python code | |||
2023-03-29 | llama : use the same threshold for OpenBLAS and ggml thread limiting (#577) | Maël Kerbiriou | |
2023-03-29 | add example of re-act pattern (#583) | Tobias Lütke | |
* add example of re-act pattern * spelling... * fixed whitespace in reverse prompt issue | |||
2023-03-29 | Fix GCC warning about binary literal (#595) | anzz1 | |
0b10101010 -> 0xAA /* 0b10101010 */ | |||
2023-03-29 | Fix typo in llama.h (#593) | anzz1 | |
2023-03-28 | Enable Fused-Multiply-Add (FMA) and F16C/CVT16 vector extensions on MSVC (#375) | anzz1 | |
* Enable Fused-Multiply-Add (FMA) instructions on MSVC __FMA__ macro does not exist in MSVC * Enable F16C/CVT16 vector extensions on MSVC __F16C__ macro does not exist in MSVC, but is implied with AVX2/AVX512 * MSVC cvt intrinsics * Add __SSE3__ macro for MSVC too because why not even though it's not currently used for anything when AVX is defined | |||
2023-03-28 | CI: fix subdirectory path globbing (#546) | anzz1 | |
- Changes in subdirectories will now be detecter properly - (Windows-MSVC) AVX512 tests temporarily disabled | |||
2023-03-28 | llama : fix linkage with mingw (#551) | anzz1 | |
* Revert 7e53955 (#542) Still needs to be fixed properly * Fix linking on mingw32 | |||
2023-03-28 | ggml : add AVX2 implementation of quantize_row_q4_1 (#515) | slaren | |
* Add AVX2 implementation of quantize_row_q4_1 * Actually use AVX2 * Make quantize_row_q4_1 static Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> | |||
2023-03-28 | py : add temporary script to convert old ggml files to newer version (#539) | thement | |
Co-authored-by: Jakub Horak <jakub.horak@ibawizard.net> | |||
2023-03-28 | py : add capabiliy to convert from ggml back to torch or hf format for ↵ | Tai Duc Nguyen | |
further consumption/training/finetuning (#403) | |||
2023-03-28 | ggml : refactor quantized processing functions (#509) | Stephan Walter | |
* Refactor quantized processing functions * ggml : minor --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> | |||
2023-03-28 | py : removed unused `model` variable and verified that the code functions ↵ | DooWoong Lee (David) | |
correctly with `vocab_only` setting. Also confirmed that the code works as expected after running with reduced memory usage due to deletion of no-longer-needed variable. (#547) | |||
2023-03-28 | ci : make ctest verbose, hopefully we see what is wrong with the sanitizer | Georgi Gerganov | |
2023-03-28 | tests : free llama context at the end of the test | Georgi Gerganov | |
2023-03-28 | all : be more strict about converting float to double (#458) | Stephan Walter | |
* Be more strict about converting float to double * Test equivalence of round, SILU implementations Test module is commented out in CMakeLists.txt because the tests may take a long time, depending on how much the compiler optimizes. * Fix softmax in perplexity.cpp * all : prefer float over double where appropriate * perplexity : add <cmath> --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> | |||
2023-03-28 | deploy : add a Package.swift for SwiftPM support (#393) | Jed Fox | |
* Add a Package.swift for SwiftPM support * Swap from exclusions to allowlist | |||
2023-03-28 | ggml : introduce structs for the q4 data blocks (#356) | Stephan Walter | |
* Introduce structs for the q4 data blocks * ggml : rename quant struct variables + fix ARM_NEON --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> | |||
2023-03-28 | gitignore : add "embedding" | Georgi Gerganov | |
2023-03-28 | Check the existence of f16_model_path_base in quantize.py (#574) | dotpy314 | |
Co-authored-by: Jincheng Miao <jincheng.miao@gmail.com> | |||
2023-03-28 | Fix usage of F16C intrinsics in AVX code (#563) | slaren | |
* Fix usage of F16C intrinsics in AVX code when F16C is not defined | |||
2023-03-28 | main.cpp fixes, refactoring (#571) | anzz1 | |
- main: entering empty line passes back control without new input in interactive/instruct modes - instruct mode: keep prompt fix - instruct mode: duplicate instruct prompt fix - refactor: move common console code from main->common | |||
2023-03-28 | Add embedding example to Makefile (#540) | RJ Adriaansen | |
2023-03-27 | Fix missing ggml link in cmake for examples/* on w64-mingw32 (#542) | Marco Matthies | |
2023-03-26 | ci: add debug build to sanitizer build matrix (#527) | Erik Scholz | |
2023-03-26 | Fix undefined variables in debug build, remove unused variables (#531) | Stephan Walter | |
2023-03-26 | Add support for linux/arm64 platform during Docker Builds (#514) | Juan Calderon-Perez | |
* Add support for linux/arm64 platform * Add platform to versioned builds | |||
2023-03-26 | Update README and comments for standalone perplexity tool (#525) | Stephan Walter | |
2023-03-26 | [main] fix infinite generation (-n == -1) (#523) | anzz1 | |