aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2023-03-19Refactoring `convert-pth-to-ggml.py`: more concise and readable (#109)qunash
2023-03-19Drop trailing new line from file prompts (#80)Georgi Gerganov
2023-03-19Add instruction for using Alpaca (#240)Georgi Gerganov
2023-03-19Add "--instruct" argument for usage with Alpaca (#240)Georgi Gerganov
2023-03-19Change RMSNorm eps to 1e-6 (#173)Georgi Gerganov
2023-03-18Warn user if a context size greater than 2048 tokens is specified (#274)Ronsor
2023-03-18Fix typo in readmePavol Rusnak
2023-03-18Add note about Python 3.11 to readmePavol Rusnak
2023-03-18Add memory/disk requirements to readmePavol Rusnak
2023-03-18Remove unused code since n_vocab is model.hparams.n_vocab (#262)Alex Nguyen
2023-03-18fixed warning with std::ignore about unused function result (#151)Justin Suess
2023-03-18Fix n^2 loop in tokenization (#254)Gary Linscott
2023-03-18CI Improvements (#230)anzz1
2023-03-17Nix flake (#40)Niklas Korz
2023-03-17Implement non-greedy tokenizer that tries to maximize token lengths (#242)thement
2023-03-17Default to 4 threads (#243)Georgi Gerganov
2023-03-17Update Contributing sectionGeorgi Gerganov
2023-03-17Don't tell users to use a bad number of threads (#243)Stephan Walter
2023-03-17add ptread link to fix cmake build under linux (#114)mmyjona
2023-03-17šŸš€ Dockerize llamacpp (#132)Bernat Vadell
2023-03-17Q4_1 quantization (#193)Matvey Soloviev
2023-03-16Update README.mdGeorgi Gerganov
2023-03-16Expand "Contributing" sectionGeorgi Gerganov
2023-03-16Update hot topics - RMSnormGeorgi Gerganov
2023-03-15Fix RMS norm in GGML (#191)Nebula
2023-03-16Add RMS norm and use it (#187)hoangmit
2023-03-15fixed typo (#178)moritzbrantner
2023-03-15add SIGINT support for _WIN32 environments (#120)Rickey Bowers Jr
2023-03-15added ctx_size parameter (#148)Justin Suess
2023-03-15fixed color reset on exit (#149)Justin Suess
2023-03-15Fix potential licensing issue (#126)Musab Gultekin
2023-03-15Use `tokenizer.vocab_size()` instead of hardcoding 32000 in convert-pth-to-gg...Ronsor
2023-03-15inline -> static inline for "bytesFromNibbles" (#161)hoangmit
2023-03-14Don't use vdotq_s32 if it's not available (#139)Ronsor
2023-03-14Add section to README on how to run the project on Android (#130)Radoslav Gerganov
2023-03-14Add Misc section + update hot topics + minor fixesGeorgi Gerganov
2023-03-13Add windows to the CI (#98)SebastiƔn A
2023-03-13CMake build in Release by default (#75)Georgi Gerganov
2023-03-13Update contribution section, hot topics, limitations, etc.Georgi Gerganov
2023-03-13Print system informationGeorgi Gerganov
2023-03-13Initial support for CMake (#75)SebastiƔn A
2023-03-13Add NetBSD support. (#90)Thomas Klausner
2023-03-13Use fprintf for diagnostic output (#48)Pavol Rusnak
2023-03-13Use vdotq_s32 to improve performance (#67)Georgi Gerganov
2023-03-13Reduce model loading time (#43)uint256_t
2023-03-13Fix UTF-8 handling (including colors) (#79)Val Kharitonov
2023-03-13Add quantize script for batch quantization (#92)Pavol Rusnak
2023-03-13Add initial contribution guidelinesGeorgi Gerganov
2023-03-13Gate signal support on being on a unixoid system. (#74)Matvey Soloviev
2023-03-13Fix token count accountingMatvey Soloviev