ollama/gpu
Daniel Hiltgen b08870aff3
Merge pull request #4188 from dhiltgen/use_our_lib
User our bundled libraries (cuda) instead of the host library
2024-05-06 14:41:05 -07:00
..
amd_common.go Support Fedoras standard ROCm location 2024-05-01 15:47:12 -07:00
amd_hip_windows.go Request and model concurrency 2024-04-22 19:29:12 -07:00
amd_linux.go Support Fedoras standard ROCm location 2024-05-01 15:47:12 -07:00
amd_windows.go Support Fedoras standard ROCm location 2024-05-01 15:47:12 -07:00
assets.go Centralize server config handling 2024-05-05 16:49:50 -07:00
cpu_common.go Mechanical switch from log to slog 2024-01-18 14:12:57 -08:00
cuda_common.go Request and model concurrency 2024-04-22 19:29:12 -07:00
gpu.go Use our libraries first 2024-05-06 14:23:29 -07:00
gpu_darwin.go gpu: add 512MiB to darwin minimum, metal doesn't have partial offloading overhead (#4068) 2024-05-01 11:46:03 -04:00
gpu_info.h Add CUDA Driver API for GPU discovery 2024-04-30 18:00:45 -07:00
gpu_info_cpu.c Request and model concurrency 2024-04-22 19:29:12 -07:00
gpu_info_cudart.c Request and model concurrency 2024-04-22 19:29:12 -07:00
gpu_info_cudart.h Add CUDA Driver API for GPU discovery 2024-04-30 18:00:45 -07:00
gpu_info_darwin.h darwin: no partial offloading if required memory greater than system 2024-04-16 11:22:38 -07:00
gpu_info_darwin.m darwin: no partial offloading if required memory greater than system 2024-04-16 11:22:38 -07:00
gpu_info_nvcuda.c Add CUDA Driver API for GPU discovery 2024-04-30 18:00:45 -07:00
gpu_info_nvcuda.h Add CUDA Driver API for GPU discovery 2024-04-30 18:00:45 -07:00
gpu_test.go Request and model concurrency 2024-04-22 19:29:12 -07:00
types.go Request and model concurrency 2024-04-22 19:29:12 -07:00