ollama/gpu
2024-05-06 17:04:19 -07:00
..
amd_common.go Support Fedoras standard ROCm location 2024-05-01 15:47:12 -07:00
amd_hip_windows.go Request and model concurrency 2024-04-22 19:29:12 -07:00
amd_linux.go Support Fedoras standard ROCm location 2024-05-01 15:47:12 -07:00
amd_windows.go Support Fedoras standard ROCm location 2024-05-01 15:47:12 -07:00
assets.go Centralize server config handling 2024-05-05 16:49:50 -07:00
cpu_common.go Mechanical switch from log to slog 2024-01-18 14:12:57 -08:00
cuda_common.go Request and model concurrency 2024-04-22 19:29:12 -07:00
gpu.go llm: add minimum based on layer size 2024-05-06 17:04:19 -07:00
gpu_darwin.go llm: add minimum based on layer size 2024-05-06 17:04:19 -07:00
gpu_info.h Add CUDA Driver API for GPU discovery 2024-04-30 18:00:45 -07:00
gpu_info_cpu.c Request and model concurrency 2024-04-22 19:29:12 -07:00
gpu_info_cudart.c Request and model concurrency 2024-04-22 19:29:12 -07:00
gpu_info_cudart.h Add CUDA Driver API for GPU discovery 2024-04-30 18:00:45 -07:00
gpu_info_darwin.h darwin: no partial offloading if required memory greater than system 2024-04-16 11:22:38 -07:00
gpu_info_darwin.m darwin: no partial offloading if required memory greater than system 2024-04-16 11:22:38 -07:00
gpu_info_nvcuda.c Add CUDA Driver API for GPU discovery 2024-04-30 18:00:45 -07:00
gpu_info_nvcuda.h Add CUDA Driver API for GPU discovery 2024-04-30 18:00:45 -07:00
gpu_test.go Request and model concurrency 2024-04-22 19:29:12 -07:00
types.go Request and model concurrency 2024-04-22 19:29:12 -07:00