llama.cpp

History

Daniele Morotti f1c631dc53 Bug fixed with n_ctx=0 (#1015 ) If the n_ctx is set to 0 the code should use the maximum context length of the selected model, but it didn't work. There was a problem with the initialization of this parameter and a related problem with 'n_batch'.		2023-12-16 18:59:50 -05:00
..
server	Add support for running the server with SSL (#994 )	2023-12-11 20:47:11 -05:00
__init__.py	Bump version	2023-12-13 21:52:29 -05:00
_utils.py	Fix UnsupportedOperation: fileno in suppress_stdout_stderr (#961 )	2023-12-11 20:44:51 -05:00
llama.py	Bug fixed with n_ctx=0 (#1015 )	2023-12-16 18:59:50 -05:00
llama_chat_format.py	Add qwen chat format (#1005 )	2023-12-13 21:43:43 -05:00
llama_cpp.py	Update llama.cpp	2023-12-16 18:57:43 -05:00
llama_grammar.py	Add from_json_schema to LlamaGrammar	2023-11-23 00:27:00 -05:00
llama_types.py	Add missing tool_calls finish_reason	2023-11-10 02:51:06 -05:00
llava_cpp.py	Make building llava optional	2023-11-28 04:55:21 -05:00
py.typed	Add py.typed	2023-08-11 09:58:48 +02:00