d4cd695759
Run the server.cpp directly inside the Go runtime via cgo while retaining the LLM Go abstractions. |
||
---|---|---|
.. | ||
llama.cpp | ||
ext_server.go | ||
ggml.go | ||
gguf.go | ||
gpu_cuda.go | ||
gpu_darwin.go | ||
llama.go | ||
llm.go | ||
utils.go |
d4cd695759
Run the server.cpp directly inside the Go runtime via cgo while retaining the LLM Go abstractions. |
||
---|---|---|
.. | ||
llama.cpp | ||
ext_server.go | ||
ggml.go | ||
gguf.go | ||
gpu_cuda.go | ||
gpu_darwin.go | ||
llama.go | ||
llm.go | ||
utils.go |