llama.cpp/llama_cpp
Daniele Morotti f1c631dc53
Bug fixed with n_ctx=0 (#1015)
If the n_ctx is set to 0 the code should use the maximum context length of the selected model, but it didn't work. There was a problem with the initialization of this parameter and a related problem with 'n_batch'.
2023-12-16 18:59:50 -05:00
..
server Add support for running the server with SSL (#994) 2023-12-11 20:47:11 -05:00
__init__.py Bump version 2023-12-13 21:52:29 -05:00
_utils.py Fix UnsupportedOperation: fileno in suppress_stdout_stderr (#961) 2023-12-11 20:44:51 -05:00
llama.py Bug fixed with n_ctx=0 (#1015) 2023-12-16 18:59:50 -05:00
llama_chat_format.py Add qwen chat format (#1005) 2023-12-13 21:43:43 -05:00
llama_cpp.py Update llama.cpp 2023-12-16 18:57:43 -05:00
llama_grammar.py Add from_json_schema to LlamaGrammar 2023-11-23 00:27:00 -05:00
llama_types.py Add missing tool_calls finish_reason 2023-11-10 02:51:06 -05:00
llava_cpp.py Make building llava optional 2023-11-28 04:55:21 -05:00
py.typed Add py.typed 2023-08-11 09:58:48 +02:00