Age | Commit message (Collapse) | Author |
|
* working but ugly
* add arg flag, not working on embedding mode
* typo
* Working! Thanks to @nullhook
* make params argument instead of hardcoded boolean. remove useless time check
* start doing the instructions but not finished. This probably doesnt compile
* Embeddings extraction support
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
|
|
This reverts commit 4870e455b3653f7d7769fa5772b2c90ffad088df.
Will provide the correct fix later
|
|
|
|
Should make results reproducible for different number of threads and batch sizes
|
|
fixed: memcpy is not defined
|
|
|
|
* Major refactoring - introduce C-style API
* Clean up
* Add <cassert>
* Add <iterator>
* Add <algorithm> ....
* Fix timing reporting and accumulation
* Measure eval time only for single-token calls
* Change llama_tokenize return meaning
|