llama.cpp.git - llama.cpp

Age	Commit message (Expand)	Author
2023-04-14	ggml : always allocate buffers with size multiple of GGML_MEM_ALIGN	Georgi Gerganov
2023-04-14	ggml : fix q4_1 dot product types	Georgi Gerganov
2023-04-14	ggml : optimize rope function to avoid call powf in the tight loop (#807)	Howard Su
2023-04-13	ggml : add GGML_DEFAULT_N_THREADS	Georgi Gerganov
2023-04-13	ggml : speed-up ggml_vec_dot_q4_1() ARM_NEON + 32-bit ARM support (#900)	Georgi Gerganov
2023-04-13	ggml : optimize non-SIMD Q4_0 vector dot product (#703)	Stephan Walter
2023-04-13	ggml : introduce GGML_ALIGNED_MALLOC/GGML_ALIGNED_FREE macros (#884)	Pavol Rusnak
2023-04-13	ggml : update cblas_sgemm columns var to be more reasonable (#838)	Vladimir
2023-04-11	Fix whitespace, add .editorconfig, add GitHub workflow (#883)	Pavol Rusnak
2023-04-11	Add enum llama_ftype, sync ggml_type to model files (#709)	Stephan Walter
2023-04-11	Windows fixes (#890)	comex
2023-04-10	ggml : fix WASM build	Georgi Gerganov
2023-04-10	ggml : add ggml_cont() + optimize ggml_cpy() for contiguous dst	Georgi Gerganov
2023-04-10	ggml : remove trailing whitespaces	Georgi Gerganov
2023-04-10	Simplify to include lower-case windows.h always, fix compile on mingw32 (#747)	Marco Matthies
2023-04-10	ggml : fix quantize_row_q4_1() ARM_NEON (close #876)	Georgi Gerganov
2023-04-10	Rewrite loading code to try to satisfy everyone:	comex
2023-04-08	Add quantize-stats command for testing quantization (#728)	unbounded
2023-04-05	ggml : multi-thread ggml_rope() (~3-4 times faster on M1) (#781)	Georgi Gerganov
2023-04-05	ggml, llama : avoid heavy V transpose + improvements (#775)	Georgi Gerganov
2023-04-03	10+% performance improvement of ggml_vec_dot_q4_0 on AVX2 (#654)	SebastianApel
2023-04-02	ggml : change ne to int64_t (#626)	Marian Cepok
2023-03-31	Enable -std= for cmake builds, fix warnings (#598)	Stephan Walter
2023-03-31	Optimize AVX2 ggml_vec_dot_q4_0 (#642)	slaren
2023-03-31	Add AVX acceleration (#617)	perserk
2023-03-30	Ensure --mlock works properly with mmap() support	Justine Tunney
2023-03-30	Add mmap support for model files	Slaren
2023-03-30	Remove unused variable (#607)	Casey Primozic
2023-03-30	ggml : fix NEON signs (close #620, #622)	Georgi Gerganov
2023-03-30	Fix GGML_F32Cx8_STORE in AVX without F16C path (#619)	slaren
2023-03-29	ggml : init time on first ggml_init() call	Georgi Gerganov
2023-03-29	ggml : add ARM_NEON dequantize_row_q4_1()	Georgi Gerganov
2023-03-29	ggml : add ARM_NEON quantize_row_q4_1()	Georgi Gerganov
2023-03-29	ggml : add ARM_NEON ggml_vec_dot_q4_1()	Georgi Gerganov
2023-03-29	Fix GCC warning about binary literal (#595)	anzz1
2023-03-28	Enable Fused-Multiply-Add (FMA) and F16C/CVT16 vector extensions on MSVC (#375)	anzz1
2023-03-28	ggml : add AVX2 implementation of quantize_row_q4_1 (#515)	slaren
2023-03-28	ggml : refactor quantized processing functions (#509)	Stephan Walter
2023-03-28	all : be more strict about converting float to double (#458)	Stephan Walter
2023-03-28	ggml : introduce structs for the q4 data blocks (#356)	Stephan Walter
2023-03-28	Fix usage of F16C intrinsics in AVX code (#563)	slaren
2023-03-26	Fix undefined variables in debug build, remove unused variables (#531)	Stephan Walter
2023-03-25	Add AVX2 implementation of dequantize_row_q4_1 (#505)	slaren
2023-03-25	Overhaul the examples structure	Georgi Gerganov
2023-03-25	Retire the ggml_mul_mat() branch for transposed src0 (#500)	Georgi Gerganov
2023-03-25	Add AVX2 implementation of dequantize_row_q4_0 (#467)	slaren
2023-03-25	Remove obsolete assert and fix compiler warning	Georgi Gerganov
2023-03-25	Fix nasty bug in ggml_compute_forward_mul_mat_f32() and reenable BLAS	Georgi Gerganov
2023-03-24	Disable BLAS altogether - the bug is not just for qunatized mat mul	Georgi Gerganov
2023-03-24	Disable BLAS branch in mul_mat - seems there is a bug	Georgi Gerganov