ollama

History

Daniel Hiltgen b08870aff3 Merge pull request #4188 from dhiltgen/use_our_lib User our bundled libraries (cuda) instead of the host library		2024-05-06 14:41:05 -07:00
..
amd_common.go	Support Fedoras standard ROCm location	2024-05-01 15:47:12 -07:00
amd_hip_windows.go	Request and model concurrency	2024-04-22 19:29:12 -07:00
amd_linux.go	Support Fedoras standard ROCm location	2024-05-01 15:47:12 -07:00
amd_windows.go	Support Fedoras standard ROCm location	2024-05-01 15:47:12 -07:00
assets.go	Centralize server config handling	2024-05-05 16:49:50 -07:00
cpu_common.go	Mechanical switch from log to slog	2024-01-18 14:12:57 -08:00
cuda_common.go	Request and model concurrency	2024-04-22 19:29:12 -07:00
gpu.go	Use our libraries first	2024-05-06 14:23:29 -07:00
gpu_darwin.go	gpu: add 512MiB to darwin minimum, metal doesn't have partial offloading overhead (#4068 )	2024-05-01 11:46:03 -04:00
gpu_info.h	Add CUDA Driver API for GPU discovery	2024-04-30 18:00:45 -07:00
gpu_info_cpu.c	Request and model concurrency	2024-04-22 19:29:12 -07:00
gpu_info_cudart.c	Request and model concurrency	2024-04-22 19:29:12 -07:00
gpu_info_cudart.h	Add CUDA Driver API for GPU discovery	2024-04-30 18:00:45 -07:00
gpu_info_darwin.h	darwin: no partial offloading if required memory greater than system	2024-04-16 11:22:38 -07:00
gpu_info_darwin.m	darwin: no partial offloading if required memory greater than system	2024-04-16 11:22:38 -07:00
gpu_info_nvcuda.c	Add CUDA Driver API for GPU discovery	2024-04-30 18:00:45 -07:00
gpu_info_nvcuda.h	Add CUDA Driver API for GPU discovery	2024-04-30 18:00:45 -07:00
gpu_test.go	Request and model concurrency	2024-04-22 19:29:12 -07:00
types.go	Request and model concurrency	2024-04-22 19:29:12 -07:00