aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2023-03-24Disable BLAS altogether - the bug is not just for qunatized mat mulGeorgi Gerganov
2023-03-24Disable BLAS branch in mul_mat - seems there is a bugGeorgi Gerganov
2023-03-24Immediately start processing the prompt before user input has been provided (...Georgi Gerganov
2023-03-24Reduce memory usage and allocate enough memory for largest context (#473)Georgi Gerganov
2023-03-24Temporary bump the memory buffer size - hopefully fix issues from 483bab2eGeorgi Gerganov
2023-03-24Update README.md (#444)Gary Mulder
2023-03-24fix instruct mode (#445)rabidcopy
2023-03-24Properly free llama_context on failureGeorgi Gerganov
2023-03-24additional optimizations for POWER9 (#454)Cameron Kaiser
2023-03-24Support calling mlock() on loaded model data on Linux and macOS (#453)comex
2023-03-24Add embedding mode with arg flag. Currently working (#282)Luciano
2023-03-24Add link to Roadmap discussionGeorgi Gerganov
2023-03-24Revert "Fix memory allocation issues and seg faults"Georgi Gerganov
2023-03-24Fix memory allocation issues and seg faultsGeorgi Gerganov
2023-03-23Avoid the transposed X branch in the Z = X * Y matrix multiplication (#439)Georgi Gerganov
2023-03-23Fix quantize script not finding models in parent directory (#428)Jed Fox
2023-03-23Remove oboslete command from Docker scriptGeorgi Gerganov
2023-03-23ObsoleteGeorgi Gerganov
2023-03-23Replace EOS with newline to prevent context/memory being flushed by EOS in in...rabidcopy
2023-03-23Fix GPTQ converter (#423)Timmy Knight
2023-03-23Generate library with CMake (#430)nusu-github
2023-03-23Command line args bounds checking (#424)anzz1
2023-03-23Fix Nix buildBen Siraphob
2023-03-23Revert "Delete SHA256SUMS for now" (#429)Stephan Walter
2023-03-23Fix Makefile echo escape codes (by removing them). (#418)Kerfuffle
2023-03-23Move model section from issue template to README.md (#421)Gary Mulder
2023-03-23Delete SHA256SUMS for now (#416)anzz1
2023-03-23Adjust repetition penalty ..Georgi Gerganov
2023-03-23Add link to recent podcast about whisper.cpp and llama.cppGeorgi Gerganov
2023-03-23CI: CMake: Separate build and test steps (#376)anzz1
2023-03-23Fix instruct mode broken by PR #354 (#409)tjohnman
2023-03-22Update issue template so people will use it (#404)Gary Mulder
2023-03-22Deduplicate q4 quantization functions (#383)Stephan Walter
2023-03-22fix: add POSIX functionality for Linux compilation (#51)Valentyn Bezshapkin
2023-03-22Don't force immediate interactive without `-i` (#354)tjohnman
2023-03-22cmake: make llama an actual library (#392)Erik Scholz
2023-03-22fix perplexity after c-api refactor (#390)Erik Scholz
2023-03-22Add details on perplexity to README.md (#395)Gary Linscott
2023-03-22Add missing header for memcpy (#386)Yusuf Kağan Hanoğlu
2023-03-22When seed <= 0 - use the clock to generate oneGeorgi Gerganov
2023-03-22Init llama_context_params properly from CLI (#370)Georgi Gerganov
2023-03-22Remove temporary notice and update hot topicsGeorgi Gerganov
2023-03-22Introduce C-style API (#370)Georgi Gerganov
2023-03-21Add SHA256SUMS file and instructions to README how to obtain and verify the d...Gary Mulder
2023-03-22Fix bin dir for win cianzz1
2023-03-21specify build type for ctest on windows (#371)Erik Scholz
2023-03-21Add notice about pending changeGeorgi Gerganov
2023-03-21fix typo in chatLLaMa (#368)Mathieu Nayrolles
2023-03-21Update issue templatesGeorgi Gerganov
2023-03-21We could use std::unordered_map over std::map (#305)Fabio R. Sluzala