Add docs explaining GPU selection env vars

2024-03-11 16:54:38 -07:00 · 2024-03-11 16:54:38 -07:00 · b53229a2ed
commit b53229a2ed
parent b5fcd9d3aa
1 changed files with 10 additions and 0 deletions
--- a/docs/faq.md
+++ b/docs/faq.md
@ -193,3 +193,13 @@ To unload the model and free up memory use:
 ```shell
 curl http://localhost:11434/api/generate -d '{"model": "llama2", "keep_alive": 0}'
 ```
 ## Controlling which GPUs to use
 By default, on Linux and Windows, Ollama will attempt to use Nvidia GPUs, or
 Radeon GPUs, and will use all the GPUs it can find. You can limit which GPUs
 will be utilized by setting the environment variable `CUDA_VISIBLE_DEVICES` for
 NVIDIA cards, or `HIP_VISIBLE_DEVICES` for Radeon GPUs to a comma delimited list
 of GPU IDs.  You can see the list of devices with GPU tools such as `nvidia-smi` or
 `rocminfo`. You can set to an invalid GPU ID (e.g., "-1") to bypass the GPU and
 fallback to CPU.