ollama/gpu
Daniel Hiltgen 9929751cc8 Disable concurrency for AMD + Windows
Until ROCm v6.2 ships, we wont be able to get accurate free memory
reporting on windows, which makes automatic concurrency too risky.
Users can still opt-in but will need to pay attention to model sizes otherwise they may thrash/page VRAM or cause OOM crashes.
All other platforms and GPUs have accurate VRAM reporting wired
up now, so we can turn on concurrency by default.
2024-06-21 15:45:05 -07:00
..
amd_common.go Support Fedoras standard ROCm location 2024-05-01 15:47:12 -07:00
amd_hip_windows.go Record more GPU information 2024-05-09 14:18:14 -07:00
amd_linux.go Merge pull request #4875 from dhiltgen/rocm_gfx900_workaround 2024-06-15 07:38:58 -07:00
amd_windows.go Disable concurrency for AMD + Windows 2024-06-21 15:45:05 -07:00
assets.go err!=nil check 2024-06-20 09:30:59 -07:00
cpu_common.go review comments and coverage 2024-06-14 14:55:50 -07:00
cuda_common.go lint linux 2024-06-04 11:13:30 -07:00
gpu.go Revert "Revert "gpu: add env var for detecting Intel oneapi gpus (#5076)"" 2024-06-19 08:57:41 -07:00
gpu_darwin.go review comments and coverage 2024-06-14 14:55:50 -07:00
gpu_info.h Reintroduce nvidia nvml library for windows 2024-06-14 14:51:40 -07:00
gpu_info_cudart.c Fix bad symbol load detection 2024-06-19 08:39:07 -07:00
gpu_info_cudart.h Refine GPU discovery to bootstrap once 2024-06-14 14:51:40 -07:00
gpu_info_darwin.h darwin: no partial offloading if required memory greater than system 2024-04-16 11:22:38 -07:00
gpu_info_darwin.m darwin: no partial offloading if required memory greater than system 2024-04-16 11:22:38 -07:00
gpu_info_nvcuda.c Fix bad symbol load detection 2024-06-19 08:39:07 -07:00
gpu_info_nvcuda.h Reintroduce nvidia nvml library for windows 2024-06-14 14:51:40 -07:00
gpu_info_nvml.c Fix bad symbol load detection 2024-06-19 08:39:07 -07:00
gpu_info_nvml.h Reintroduce nvidia nvml library for windows 2024-06-14 14:51:40 -07:00
gpu_info_oneapi.c get real func ptr. 2024-06-19 09:00:51 +08:00
gpu_info_oneapi.h review comments and coverage 2024-06-14 14:55:50 -07:00
gpu_linux.go review comments and coverage 2024-06-14 14:55:50 -07:00
gpu_oneapi.go support ollama run on Intel GPUs 2024-05-24 11:18:27 +08:00
gpu_test.go lint 2024-06-04 11:13:30 -07:00
gpu_windows.go review comments and coverage 2024-06-14 14:55:50 -07:00
types.go Disable concurrency for AMD + Windows 2024-06-21 15:45:05 -07:00