add OLLAMA_KEEP_ALIVE env variable to FAQ (#3865)

This commit is contained in:
Patrick Devine 2024-04-23 21:06:51 -07:00 committed by GitHub
parent 14476d48cc
commit 74d2a9ef9a
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -228,3 +228,7 @@ To unload the model and free up memory use:
```shell
curl http://localhost:11434/api/generate -d '{"model": "llama2", "keep_alive": 0}'
```
Alternatively, you can change the amount of time all models are loaded into memory by setting the `OLLAMA_KEEP_ALIVE` environment variable when starting the Ollama server. The `OLLAMA_KEEP_ALIVE` variable uses the same parameter types as the `keep_alive` parameter types mentioned above. Refer to section explaining [how to configure the Ollama server](#how-do-i-configure-ollama-server) to correctly set the environment variable.
If you wish to override the `OLLAMA_KEEP_ALIVE` setting, use the `keep_alive` API parameter with the `/api/generate` or `/api/chat` API endpoints.