aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2023-03-17🚀 Dockerize llamacpp (#132)Bernat Vadell
2023-03-17Q4_1 quantization (#193)Matvey Soloviev
2023-03-16Update README.mdGeorgi Gerganov
2023-03-16Expand "Contributing" sectionGeorgi Gerganov
2023-03-16Update hot topics - RMSnormGeorgi Gerganov
2023-03-15Fix RMS norm in GGML (#191)Nebula
2023-03-16Add RMS norm and use it (#187)hoangmit
2023-03-15fixed typo (#178)moritzbrantner
2023-03-15add SIGINT support for _WIN32 environments (#120)Rickey Bowers Jr
2023-03-15added ctx_size parameter (#148)Justin Suess
2023-03-15fixed color reset on exit (#149)Justin Suess
2023-03-15Fix potential licensing issue (#126)Musab Gultekin
2023-03-15Use `tokenizer.vocab_size()` instead of hardcoding 32000 in convert-pth-to-gg...Ronsor
2023-03-15inline -> static inline for "bytesFromNibbles" (#161)hoangmit
2023-03-14Don't use vdotq_s32 if it's not available (#139)Ronsor
2023-03-14Add section to README on how to run the project on Android (#130)Radoslav Gerganov
2023-03-14Add Misc section + update hot topics + minor fixesGeorgi Gerganov
2023-03-13Add windows to the CI (#98)Sebastián A
2023-03-13CMake build in Release by default (#75)Georgi Gerganov
2023-03-13Update contribution section, hot topics, limitations, etc.Georgi Gerganov
2023-03-13Print system informationGeorgi Gerganov
2023-03-13Initial support for CMake (#75)Sebastián A
2023-03-13Add NetBSD support. (#90)Thomas Klausner
2023-03-13Use fprintf for diagnostic output (#48)Pavol Rusnak
2023-03-13Use vdotq_s32 to improve performance (#67)Georgi Gerganov
2023-03-13Reduce model loading time (#43)uint256_t
2023-03-13Fix UTF-8 handling (including colors) (#79)Val Kharitonov
2023-03-13Add quantize script for batch quantization (#92)Pavol Rusnak
2023-03-13Add initial contribution guidelinesGeorgi Gerganov
2023-03-13Gate signal support on being on a unixoid system. (#74)Matvey Soloviev
2023-03-13Fix token count accountingMatvey Soloviev
2023-03-13Revert "10% performance boost on ARM"Georgi Gerganov
2023-03-13Check for vdotq_s32 availabilityGeorgi Gerganov
2023-03-13Ammend to previous commit - forgot to update non-QRDMX branchGeorgi Gerganov
2023-03-1310% performance boost on ARMGeorgi Gerganov
2023-03-13Fix color getting reset before prompt output done (#65)Matvey Soloviev
2023-03-12Update README.mdGeorgi Gerganov
2023-03-12Add interactive mode (#61)Matvey Soloviev
2023-03-12Fix typo in README (#45)Marc Köhlbrugge
2023-03-12Allow using prompt files (#59)Ben Garney
2023-03-12Add back top_k (#56)beiller
2023-03-12Windows fixes (#31)Sebastián A
2023-03-12Update README.mdGeorgi Gerganov
2023-03-12Add CI (#60)Georgi Gerganov
2023-03-12Revert "weights_only" arg - this causing more trouble than helpGeorgi Gerganov
2023-03-12python/pytorch compat notes (#44)Oleksandr Nikitin
2023-03-12Add repetition penalty (#20)beiller
2023-03-12Clarify meaning of hackingGeorgi Gerganov
2023-03-12README: add "Supported platforms" + update hot topicsGeorgi Gerganov
2023-03-12use weights_only in conversion script (#32)deepdiffuser