ollama

History

65a f8ef4439e9 Use build tags to generate accelerated binaries for CUDA and ROCm on Linux. The build tags rocm or cuda must be specified to both go generate and go build. ROCm builds should have both ROCM_PATH set (and the ROCM SDK present) as well as CLBlast installed (for GGML) and CLBlast_DIR set in the environment to the CLBlast cmake directory (likely /usr/lib/cmake/CLBlast). Build tags are also used to switch VRAM detection between cuda and rocm implementations, using added "accelerator_foo.go" files which contain architecture specific functions and variables. accelerator_none is used when no tags are set, and a helper function addRunner will ignore it if it is the chosen accelerator. Fix go generate commands, thanks @deadmeu for testing.		2023-12-19 09:05:46 -08:00
..
llama.cpp	Use build tags to generate accelerated binaries for CUDA and ROCm on Linux.	2023-12-19 09:05:46 -08:00
accelerator_cuda.go	Use build tags to generate accelerated binaries for CUDA and ROCm on Linux.	2023-12-19 09:05:46 -08:00
accelerator_none.go	Use build tags to generate accelerated binaries for CUDA and ROCm on Linux.	2023-12-19 09:05:46 -08:00
accelerator_rocm.go	Use build tags to generate accelerated binaries for CUDA and ROCm on Linux.	2023-12-19 09:05:46 -08:00
ext_server.go	Add cgo implementation for llama.cpp	2023-12-19 09:05:46 -08:00
ggml.go	deprecate ggml	2023-12-19 09:05:46 -08:00
gguf.go	remove per-model types	2023-12-11 09:40:21 -08:00
gpu_cuda.go	Add cgo implementation for llama.cpp	2023-12-19 09:05:46 -08:00
gpu_darwin.go	Add cgo implementation for llama.cpp	2023-12-19 09:05:46 -08:00
llama.go	Add cgo implementation for llama.cpp	2023-12-19 09:05:46 -08:00
llm.go	Add cgo implementation for llama.cpp	2023-12-19 09:05:46 -08:00
utils.go	partial decode ggml bin for more info	2023-08-10 09:23:10 -07:00