Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-03-13 | Revert "10% performance boost on ARM" | Georgi Gerganov | |
This reverts commit 113a9e83ebc0f788f861394437087bf3ca0e019b. There are some reports for illegal instruction. Moved this stuff to vdotq_s32 branch until resolve | |||
2023-03-13 | Check for vdotq_s32 availability | Georgi Gerganov | |
2023-03-13 | Ammend to previous commit - forgot to update non-QRDMX branch | Georgi Gerganov | |
2023-03-13 | 10% performance boost on ARM | Georgi Gerganov | |
2023-03-12 | Windows fixes (#31) | Sebastián A | |
* Apply fixes suggested to build on windows Issue: https://github.com/ggerganov/llama.cpp/issues/22 * Remove unsupported VLAs * MSVC: Remove features that are only available on MSVC C++20. * Fix zero initialization of the other fields. * Change the use of vector for stack allocations. | |||
2023-03-11 | Add AVX2 support for x86 architectures thanks to @Const-me ! | Georgi Gerganov | |
2023-03-11 | Support all LLaMA models + change Q4_0 quantization storage | Georgi Gerganov | |
2023-03-10 | Initial release | Georgi Gerganov | |