aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2023-03-20Add tqdm to Python requirements (#293)Stephan Walter
2023-03-19bugfix: default should not be interactive (#304)cocktailpeanut
2023-03-19Rename scriptGeorgi Gerganov
2023-03-19Add temporary helper script for Alpaca chatGeorgi Gerganov
2023-03-19fix coloring of last `n_batch` of prompt, and refactor line input (#221)Rickey Bowers Jr
2023-03-19Support for multiple reverse prompts. (#299)tjohnman
2023-03-19Improved quantize script (#222)Suaj Carrot
2023-03-19Make prompt randomization optional. (#300)tjohnman
2023-03-19Respect the maximum number of tokens in interactive. (#298)tjohnman
2023-03-19Add --ignore-eos parameter (#181)slaren
2023-03-19interactive mode: print '\n' in sigint_handler, this flush stdout thus ensure...Qingyou Meng
2023-03-19Command line switch to use F16 for memory_k and memory_v (refactor of #154) (...Erik Scholz
2023-03-19Update hot topics to mention Alpaca supportGeorgi Gerganov
2023-03-19Fix off-by-one bug (#115)Georgi Gerganov
2023-03-19Fix python stuff (#109)Georgi Gerganov
2023-03-19Refactoring `convert-pth-to-ggml.py`: more concise and readable (#109)qunash
2023-03-19Drop trailing new line from file prompts (#80)Georgi Gerganov
2023-03-19Add instruction for using Alpaca (#240)Georgi Gerganov
2023-03-19Add "--instruct" argument for usage with Alpaca (#240)Georgi Gerganov
2023-03-19Change RMSNorm eps to 1e-6 (#173)Georgi Gerganov
2023-03-18Warn user if a context size greater than 2048 tokens is specified (#274)Ronsor
2023-03-18Fix typo in readmePavol Rusnak
2023-03-18Add note about Python 3.11 to readmePavol Rusnak
2023-03-18Add memory/disk requirements to readmePavol Rusnak
2023-03-18Remove unused code since n_vocab is model.hparams.n_vocab (#262)Alex Nguyen
2023-03-18fixed warning with std::ignore about unused function result (#151)Justin Suess
2023-03-18Fix n^2 loop in tokenization (#254)Gary Linscott
2023-03-18CI Improvements (#230)anzz1
2023-03-17Nix flake (#40)Niklas Korz
2023-03-17Implement non-greedy tokenizer that tries to maximize token lengths (#242)thement
2023-03-17Default to 4 threads (#243)Georgi Gerganov
2023-03-17Update Contributing sectionGeorgi Gerganov
2023-03-17Don't tell users to use a bad number of threads (#243)Stephan Walter
2023-03-17add ptread link to fix cmake build under linux (#114)mmyjona
2023-03-17🚀 Dockerize llamacpp (#132)Bernat Vadell
2023-03-17Q4_1 quantization (#193)Matvey Soloviev
2023-03-16Update README.mdGeorgi Gerganov
2023-03-16Expand "Contributing" sectionGeorgi Gerganov
2023-03-16Update hot topics - RMSnormGeorgi Gerganov
2023-03-15Fix RMS norm in GGML (#191)Nebula
2023-03-16Add RMS norm and use it (#187)hoangmit
2023-03-15fixed typo (#178)moritzbrantner
2023-03-15add SIGINT support for _WIN32 environments (#120)Rickey Bowers Jr
2023-03-15added ctx_size parameter (#148)Justin Suess
2023-03-15fixed color reset on exit (#149)Justin Suess
2023-03-15Fix potential licensing issue (#126)Musab Gultekin
2023-03-15Use `tokenizer.vocab_size()` instead of hardcoding 32000 in convert-pth-to-gg...Ronsor
2023-03-15inline -> static inline for "bytesFromNibbles" (#161)hoangmit
2023-03-14Don't use vdotq_s32 if it's not available (#139)Ronsor
2023-03-14Add section to README on how to run the project on Android (#130)Radoslav Gerganov