diff options
author | Georgi Gerganov <ggerganov@gmail.com> | 2023-03-10 23:53:11 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-03-10 23:53:11 +0200 |
commit | 8a01f565ff78cc6c0c5a9fa402787a2f179f2d78 (patch) | |
tree | 03654106137fbe2f3e315ee765df717605eae916 /README.md | |
parent | 70bc0b8b15b98dca23b28f0c8f5e34b27e424cda (diff) |
Update README.md
Diffstat (limited to 'README.md')
-rw-r--r-- | README.md | 3 |
1 files changed, 3 insertions, 0 deletions
@@ -2,6 +2,9 @@ Inference of [Facebook's LLaMA](https://github.com/facebookresearch/llama) model in pure C/C++ +**TEMPORARY NOTICE:** +If you observe garbage results, make sure to update to latest master. There was a bug and it was fixed here: https://github.com/ggerganov/llama.cpp/commit/70bc0b8b15b98dca23b28f0c8f5e34b27e424cda + ## Description The main goal is to run the model using 4-bit quantization on a MacBook. |