llama.cpp.git - llama.cpp

Age	Commit message (Expand)	Author
2023-03-28	deploy : add a Package.swift for SwiftPM support (#393)	Jed Fox
2023-03-28	ggml : introduce structs for the q4 data blocks (#356)	Stephan Walter
2023-03-28	gitignore : add "embedding"	Georgi Gerganov
2023-03-28	Check the existence of f16_model_path_base in quantize.py (#574)	dotpy314
2023-03-28	Fix usage of F16C intrinsics in AVX code (#563)	slaren
2023-03-28	main.cpp fixes, refactoring (#571)	anzz1
2023-03-28	Add embedding example to Makefile (#540)	RJ Adriaansen
2023-03-27	Fix missing ggml link in cmake for examples/* on w64-mingw32 (#542)	Marco Matthies
2023-03-26	ci: add debug build to sanitizer build matrix (#527)	Erik Scholz
2023-03-26	Fix undefined variables in debug build, remove unused variables (#531)	Stephan Walter
2023-03-26	Add support for linux/arm64 platform during Docker Builds (#514)	Juan Calderon-Perez
2023-03-26	Update README and comments for standalone perplexity tool (#525)	Stephan Walter
2023-03-26	[main] fix infinite generation (-n == -1) (#523)	anzz1
2023-03-26	Add logo to README.md	Georgi Gerganov
2023-03-26	Exit from interactive mode if input stream is bad (#491)	Harald Fernengel
2023-03-26	CI: Run other sanitizer builds even if one fails (#511)	anzz1
2023-03-25	Clarify console output in convert-pth-to-ggml.py (#512)	jp-x-g
2023-03-25	CMake / CI additions (#497)	anzz1
2023-03-25	(Windows) Set console to UTF-8 on init (#420)	anzz1
2023-03-25	Fix colors enabling on WIN32	Georgi Gerganov
2023-03-25	If n_predict == -1, generate forever	Georgi Gerganov
2023-03-25	Inifinite generation via context swapping (#71)	Georgi Gerganov
2023-03-25	Cleanup STL headers + fix embedding examples + minor stuff	Georgi Gerganov
2023-03-25	Move chat scripts into "./examples"	Georgi Gerganov
2023-03-25	Add AVX2 implementation of dequantize_row_q4_1 (#505)	slaren
2023-03-25	Overhaul the examples structure	Georgi Gerganov
2023-03-25	Retire the ggml_mul_mat() branch for transposed src0 (#500)	Georgi Gerganov
2023-03-25	Disable prompt verbosity by default and add option to enable (#480)	Georgi Gerganov
2023-03-25	Add AVX2 implementation of dequantize_row_q4_0 (#467)	slaren
2023-03-25	Don't interefe with BLAS for large prompts by running only 1 thread	Georgi Gerganov
2023-03-25	Add longer DAN prompt for testing big batch numbers	Georgi Gerganov
2023-03-25	Add timings for the prompt evaluation (#478)	slaren
2023-03-25	Remove obsolete information from README	Georgi Gerganov
2023-03-25	Remove obsolete assert and fix compiler warning	Georgi Gerganov
2023-03-25	Fix nasty bug in ggml_compute_forward_mul_mat_f32() and reenable BLAS	Georgi Gerganov
2023-03-25	bounds checking for input prefix (#492)	anzz1
2023-03-25	feat: '--in-prefix STRING' option (#426)	anzz1
2023-03-25	Add support for file load progress reporting callbacks (#434)	Jed Fox
2023-03-25	Add missing struct annotation (#483)	Doomsdayrs
2023-03-25	Fix crash for 65B model with pre-allocated memory (#485)	Chris Kuehl
2023-03-24	Disable BLAS altogether - the bug is not just for qunatized mat mul	Georgi Gerganov
2023-03-24	Disable BLAS branch in mul_mat - seems there is a bug	Georgi Gerganov
2023-03-24	Immediately start processing the prompt before user input has been provided (...	Georgi Gerganov
2023-03-24	Reduce memory usage and allocate enough memory for largest context (#473)	Georgi Gerganov
2023-03-24	Temporary bump the memory buffer size - hopefully fix issues from 483bab2e	Georgi Gerganov
2023-03-24	Update README.md (#444)	Gary Mulder
2023-03-24	fix instruct mode (#445)	rabidcopy
2023-03-24	Properly free llama_context on failure	Georgi Gerganov
2023-03-24	additional optimizations for POWER9 (#454)	Cameron Kaiser
2023-03-24	Support calling mlock() on loaded model data on Linux and macOS (#453)	comex