Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-03-22 | Introduce C-style API (#370) | Georgi Gerganov | |
* Major refactoring - introduce C-style API * Clean up * Add <cassert> * Add <iterator> * Add <algorithm> .... * Fix timing reporting and accumulation * Measure eval time only for single-token calls * Change llama_tokenize return meaning | |||
2023-03-21 | Add tokenizer test + revert to C++11 (#355) | Georgi Gerganov | |
* Add test-tokenizer-0 to do a few tokenizations - feel free to expand * Added option to convert-pth-to-ggml.py script to dump just the vocabulary * Added ./models/ggml-vocab.bin containing just LLaMA vocab data (used for tests) * Added utility to load vocabulary file from previous point (temporary implementation) * Avoid using std::string_view and drop back to C++11 (hope I didn't break something) * Rename gpt_vocab -> llama_vocab * All CMake binaries go into ./bin/ now | |||
2023-03-14 | Add section to README on how to run the project on Android (#130) | Radoslav Gerganov | |
2023-03-10 | Final touches | Georgi Gerganov | |