aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2023-03-21Update IPFS links to quantized alpaca with new tokenizer format (#352)Kevin Kwok
2023-03-21Change default repeat_penalty to 1.0Georgi Gerganov
2023-03-21Add tokenizer test + revert to C++11 (#355)Georgi Gerganov
2023-03-21Add initial AVX512 support for dot product on Linux (#320)Casey Primozic
2023-03-21Adding missing features of CMakeLists.txt & Refactoring (#131)nusu-github
2023-03-20Nix flake: set meta.mainProgram to llamaBen Siraphob
2023-03-20Fixed tokenizer.model not found error when model dir is symlink (#325)Qingyou Meng
2023-03-20move file magic/version to header, print expected version (#319)Mack Straight
2023-03-20Docker - Fix publish docker image in GitHub Registry (#235)Bernat Vadell
2023-03-20sentencepiece bpe compatible tokenizer (#252)Mack Straight
2023-03-20Add tqdm to Python requirements (#293)Stephan Walter
2023-03-19bugfix: default should not be interactive (#304)cocktailpeanut
2023-03-19Rename scriptGeorgi Gerganov
2023-03-19Add temporary helper script for Alpaca chatGeorgi Gerganov
2023-03-19fix coloring of last `n_batch` of prompt, and refactor line input (#221)Rickey Bowers Jr
2023-03-19Support for multiple reverse prompts. (#299)tjohnman
2023-03-19Improved quantize script (#222)Suaj Carrot
2023-03-19Make prompt randomization optional. (#300)tjohnman
2023-03-19Respect the maximum number of tokens in interactive. (#298)tjohnman
2023-03-19Add --ignore-eos parameter (#181)slaren
2023-03-19interactive mode: print '\n' in sigint_handler, this flush stdout thus ensure...Qingyou Meng
2023-03-19Command line switch to use F16 for memory_k and memory_v (refactor of #154) (...Erik Scholz
2023-03-19Update hot topics to mention Alpaca supportGeorgi Gerganov
2023-03-19Fix off-by-one bug (#115)Georgi Gerganov
2023-03-19Fix python stuff (#109)Georgi Gerganov
2023-03-19Refactoring `convert-pth-to-ggml.py`: more concise and readable (#109)qunash
2023-03-19Drop trailing new line from file prompts (#80)Georgi Gerganov
2023-03-19Add instruction for using Alpaca (#240)Georgi Gerganov
2023-03-19Add "--instruct" argument for usage with Alpaca (#240)Georgi Gerganov
2023-03-19Change RMSNorm eps to 1e-6 (#173)Georgi Gerganov
2023-03-18Warn user if a context size greater than 2048 tokens is specified (#274)Ronsor
2023-03-18Fix typo in readmePavol Rusnak
2023-03-18Add note about Python 3.11 to readmePavol Rusnak
2023-03-18Add memory/disk requirements to readmePavol Rusnak
2023-03-18Remove unused code since n_vocab is model.hparams.n_vocab (#262)Alex Nguyen
2023-03-18fixed warning with std::ignore about unused function result (#151)Justin Suess
2023-03-18Fix n^2 loop in tokenization (#254)Gary Linscott
2023-03-18CI Improvements (#230)anzz1
2023-03-17Nix flake (#40)Niklas Korz
2023-03-17Implement non-greedy tokenizer that tries to maximize token lengths (#242)thement
2023-03-17Default to 4 threads (#243)Georgi Gerganov
2023-03-17Update Contributing sectionGeorgi Gerganov
2023-03-17Don't tell users to use a bad number of threads (#243)Stephan Walter
2023-03-17add ptread link to fix cmake build under linux (#114)mmyjona
2023-03-17🚀 Dockerize llamacpp (#132)Bernat Vadell
2023-03-17Q4_1 quantization (#193)Matvey Soloviev
2023-03-16Update README.mdGeorgi Gerganov
2023-03-16Expand "Contributing" sectionGeorgi Gerganov
2023-03-16Update hot topics - RMSnormGeorgi Gerganov
2023-03-15Fix RMS norm in GGML (#191)Nebula