93ac3760cb
If there are any pending reponses (such as from potential stop tokens) then we should send them back before ending the sequence. Otherwise, we can be missing tokens at the end of a response. Fixes #6707 |
||
---|---|---|
.. | ||
ext_server | ||
generate | ||
llama.cpp@8962422b1c | ||
patches | ||
filetype.go | ||
ggla.go | ||
ggml.go | ||
ggml_test.go | ||
gguf.go | ||
llm.go | ||
llm_darwin_amd64.go | ||
llm_darwin_arm64.go | ||
llm_linux.go | ||
llm_windows.go | ||
memory.go | ||
memory_test.go | ||
payload.go | ||
server.go | ||
status.go |