ollama/gpu
Daniel Hiltgen f56aa20014 Centralize server config handling
This moves all the env var reading into one central module
and logs the loaded config once at startup which should
help in troubleshooting user server logs
2024-05-05 16:49:50 -07:00
..
amd_common.go Request and model concurrency 2024-04-22 19:29:12 -07:00
amd_hip_windows.go Request and model concurrency 2024-04-22 19:29:12 -07:00
amd_linux.go AMD gfx patch rev is hex 2024-04-24 09:43:52 -07:00
amd_windows.go AMD gfx patch rev is hex 2024-04-24 09:43:52 -07:00
assets.go Centralize server config handling 2024-05-05 16:49:50 -07:00
cpu_common.go Mechanical switch from log to slog 2024-01-18 14:12:57 -08:00
cuda_common.go Request and model concurrency 2024-04-22 19:29:12 -07:00
gpu.go Centralize server config handling 2024-05-05 16:49:50 -07:00
gpu_darwin.go gpu: add 512MiB to darwin minimum, metal doesn't have partial offloading overhead (#4068) 2024-05-01 11:46:03 -04:00
gpu_info.h Request and model concurrency 2024-04-22 19:29:12 -07:00
gpu_info_cpu.c Request and model concurrency 2024-04-22 19:29:12 -07:00
gpu_info_cudart.c Request and model concurrency 2024-04-22 19:29:12 -07:00
gpu_info_cudart.h Request and model concurrency 2024-04-22 19:29:12 -07:00
gpu_info_darwin.h darwin: no partial offloading if required memory greater than system 2024-04-16 11:22:38 -07:00
gpu_info_darwin.m darwin: no partial offloading if required memory greater than system 2024-04-16 11:22:38 -07:00
gpu_test.go Request and model concurrency 2024-04-22 19:29:12 -07:00
types.go Request and model concurrency 2024-04-22 19:29:12 -07:00