ollama/gpu
Michael Yang 91b3e4d282 update memory calcualtions
count each layer independently when deciding gpu offloading
2024-04-01 13:16:32 -07:00
..
amd_common.go Fix iGPU detection for linux 2024-03-12 16:57:19 -07:00
amd_hip_windows.go Revamp ROCm support 2024-03-07 10:36:50 -08:00
amd_linux.go Update troubleshooting link 2024-03-28 12:05:26 -07:00
amd_windows.go Finish unwinding idempotent payload logic 2024-03-09 08:34:39 -08:00
assets.go Better tmpdir cleanup 2024-03-20 16:03:19 +01:00
cpu_common.go Mechanical switch from log to slog 2024-01-18 14:12:57 -08:00
gpu.go update memory calcualtions 2024-04-01 13:16:32 -07:00
gpu_darwin.go Allow setting max vram for workarounds 2024-03-06 17:15:06 -08:00
gpu_info.h add support for libcudart.so for CUDA devices (adds Jetson support) 2024-03-25 11:07:44 -04:00
gpu_info_cpu.c calculate overhead based number of gpu devices (#1875) 2024-01-09 15:53:33 -05:00
gpu_info_cudart.c add support for libcudart.so for CUDA devices (adds Jetson support) 2024-03-25 11:07:44 -04:00
gpu_info_cudart.h add support for libcudart.so for CUDA devices (adds Jetson support) 2024-03-25 11:07:44 -04:00
gpu_info_darwin.h Determine max VRAM on macOS using recommendedMaxWorkingSetSize (#2354) 2024-02-25 18:16:45 -05:00
gpu_info_darwin.m Determine max VRAM on macOS using recommendedMaxWorkingSetSize (#2354) 2024-02-25 18:16:45 -05:00
gpu_info_nvml.c add support for libcudart.so for CUDA devices (adds Jetson support) 2024-03-25 11:07:44 -04:00
gpu_info_nvml.h add support for libcudart.so for CUDA devices (adds Jetson support) 2024-03-25 11:07:44 -04:00
gpu_test.go Merge pull request #1819 from dhiltgen/multi_variant 2024-01-11 14:00:48 -08:00
types.go update memory calcualtions 2024-04-01 13:16:32 -07:00