aboutsummaryrefslogtreecommitdiff
path: root/examples/main/README.md
diff options
context:
space:
mode:
Diffstat (limited to 'examples/main/README.md')
-rw-r--r--examples/main/README.md2
1 files changed, 1 insertions, 1 deletions
diff --git a/examples/main/README.md b/examples/main/README.md
index 7c03f92..e71ba61 100644
--- a/examples/main/README.md
+++ b/examples/main/README.md
@@ -272,7 +272,7 @@ These options help improve the performance and memory usage of the LLaMA models.
### Prompt Caching
-- `--prompt-cache FNAME`: Specify a file to cache the model state after the initial prompt. This can significantly speed up the startup time when you're using longer prompts. The file is created during the first run and is reused and updated in subsequent runs.
+- `--prompt-cache FNAME`: Specify a file to cache the model state after the initial prompt. This can significantly speed up the startup time when you're using longer prompts. The file is created during the first run and is reused and updated in subsequent runs. **Note**: Restoring a cached prompt does not imply restoring the exact state of the session at the point it was saved. So even when specifying a specific seed, you are not guaranteed to get the same sequence of tokens as the original generation.
### Quantization