ollama/llm
65a f8ef4439e9 Use build tags to generate accelerated binaries for CUDA and ROCm on Linux.
The build tags rocm or cuda must be specified to both go generate and go build.
ROCm builds should have both ROCM_PATH set (and the ROCM SDK present) as well
as CLBlast installed (for GGML) and CLBlast_DIR set in the environment to the
CLBlast cmake directory (likely /usr/lib/cmake/CLBlast). Build tags are also
used to switch VRAM detection between cuda and rocm implementations, using
added "accelerator_foo.go" files which contain architecture specific functions
and variables. accelerator_none is used when no tags are set, and a helper
function addRunner will ignore it if it is the chosen accelerator. Fix go
generate commands, thanks @deadmeu for testing.
2023-12-19 09:05:46 -08:00
..
llama.cpp Use build tags to generate accelerated binaries for CUDA and ROCm on Linux. 2023-12-19 09:05:46 -08:00
accelerator_cuda.go Use build tags to generate accelerated binaries for CUDA and ROCm on Linux. 2023-12-19 09:05:46 -08:00
accelerator_none.go Use build tags to generate accelerated binaries for CUDA and ROCm on Linux. 2023-12-19 09:05:46 -08:00
accelerator_rocm.go Use build tags to generate accelerated binaries for CUDA and ROCm on Linux. 2023-12-19 09:05:46 -08:00
ext_server.go Add cgo implementation for llama.cpp 2023-12-19 09:05:46 -08:00
ggml.go deprecate ggml 2023-12-19 09:05:46 -08:00
gguf.go remove per-model types 2023-12-11 09:40:21 -08:00
gpu_cuda.go Add cgo implementation for llama.cpp 2023-12-19 09:05:46 -08:00
gpu_darwin.go Add cgo implementation for llama.cpp 2023-12-19 09:05:46 -08:00
llama.go Add cgo implementation for llama.cpp 2023-12-19 09:05:46 -08:00
llm.go Add cgo implementation for llama.cpp 2023-12-19 09:05:46 -08:00
utils.go partial decode ggml bin for more info 2023-08-10 09:23:10 -07:00