Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-03-24 | Add embedding mode with arg flag. Currently working (#282) | Luciano | |
* working but ugly * add arg flag, not working on embedding mode * typo * Working! Thanks to @nullhook * make params argument instead of hardcoded boolean. remove useless time check * start doing the instructions but not finished. This probably doesnt compile * Embeddings extraction support --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> | |||
2023-03-22 | Introduce C-style API (#370) | Georgi Gerganov | |
* Major refactoring - introduce C-style API * Clean up * Add <cassert> * Add <iterator> * Add <algorithm> .... * Fix timing reporting and accumulation * Measure eval time only for single-token calls * Change llama_tokenize return meaning |