ollama

History

Michael Yang 4736391bfb llm: add minimum based on layer size		2024-05-06 17:04:19 -07:00
..
amd_common.go	Support Fedoras standard ROCm location	2024-05-01 15:47:12 -07:00
amd_hip_windows.go	Request and model concurrency	2024-04-22 19:29:12 -07:00
amd_linux.go	Support Fedoras standard ROCm location	2024-05-01 15:47:12 -07:00
amd_windows.go	Support Fedoras standard ROCm location	2024-05-01 15:47:12 -07:00
assets.go	Centralize server config handling	2024-05-05 16:49:50 -07:00
cpu_common.go	Mechanical switch from log to slog	2024-01-18 14:12:57 -08:00
cuda_common.go	Request and model concurrency	2024-04-22 19:29:12 -07:00
gpu.go	llm: add minimum based on layer size	2024-05-06 17:04:19 -07:00
gpu_darwin.go	llm: add minimum based on layer size	2024-05-06 17:04:19 -07:00
gpu_info.h	Add CUDA Driver API for GPU discovery	2024-04-30 18:00:45 -07:00
gpu_info_cpu.c	Request and model concurrency	2024-04-22 19:29:12 -07:00
gpu_info_cudart.c	Request and model concurrency	2024-04-22 19:29:12 -07:00
gpu_info_cudart.h	Add CUDA Driver API for GPU discovery	2024-04-30 18:00:45 -07:00
gpu_info_darwin.h	darwin: no partial offloading if required memory greater than system	2024-04-16 11:22:38 -07:00
gpu_info_darwin.m	darwin: no partial offloading if required memory greater than system	2024-04-16 11:22:38 -07:00
gpu_info_nvcuda.c	Add CUDA Driver API for GPU discovery	2024-04-30 18:00:45 -07:00
gpu_info_nvcuda.h	Add CUDA Driver API for GPU discovery	2024-04-30 18:00:45 -07:00
gpu_test.go	Request and model concurrency	2024-04-22 19:29:12 -07:00
types.go	Request and model concurrency	2024-04-22 19:29:12 -07:00