ollama/llm
Daniel Hiltgen df54c723ae Make CPU builds parallel and customizable AMD GPUs
The linux build now support parallel CPU builds to speed things up.
This also exposes AMD GPU targets as an optional setting for advaced
users who want to alter our default set.
2024-01-21 15:12:21 -08:00
..
ext_server Add multiple CPU variants for Intel Mac 2024-01-17 15:08:54 -08:00
generate Make CPU builds parallel and customizable AMD GPUs 2024-01-21 15:12:21 -08:00
llama.cpp@584d674be6 Bump llama.cpp to b1842 and add new cuda lib dep 2024-01-16 12:53:52 -08:00
dyn_ext_server.c Switch to local dlopen symbols 2024-01-19 11:37:02 -08:00
dyn_ext_server.go Unlock mutex when failing to load model (#2117) 2024-01-20 20:54:46 -05:00
dyn_ext_server.h Always dynamically load the llm server library 2024-01-11 08:42:47 -08:00
ggml.go add max context length check 2024-01-12 14:54:07 -08:00
gguf.go add max context length check 2024-01-12 14:54:07 -08:00
llama.go remove unused fields and functions 2024-01-09 09:37:40 -08:00
llm.go Mechanical switch from log to slog 2024-01-18 14:12:57 -08:00
payload_common.go use gzip for runner embedding (#2067) 2024-01-19 13:23:03 -05:00
payload_darwin_amd64.go Add multiple CPU variants for Intel Mac 2024-01-17 15:08:54 -08:00
payload_darwin_arm64.go Add multiple CPU variants for Intel Mac 2024-01-17 15:08:54 -08:00
payload_linux.go Add multiple CPU variants for Intel Mac 2024-01-17 15:08:54 -08:00
payload_test.go Fix up the CPU fallback selection 2024-01-11 15:27:06 -08:00
payload_windows.go Add multiple CPU variants for Intel Mac 2024-01-17 15:08:54 -08:00
utils.go partial decode ggml bin for more info 2023-08-10 09:23:10 -07:00