llama.cpp

History

Andrei da003d8768 Automatically set chat format from gguf (#1110 ) * Use jinja formatter to load chat format from gguf * Fix off-by-one error in metadata loader * Implement chat format auto-detection		2024-01-29 14:22:23 -05:00
..
__init__.py	llama_cpp server: app is now importable, still runnable as a module	2023-04-29 11:41:25 -07:00
__main__.py	[Feat] Multi model support (#931 )	2023-12-22 05:51:25 -05:00
app.py	feat(server): include llama-cpp-python version in openapi spec	2024-01-25 11:23:18 -05:00
cli.py	Fix python3.8 support	2024-01-19 08:17:49 -05:00
errors.py	server: Support none defaulting to infinity for completions (#111 )	2023-12-22 14:05:13 -05:00
model.py	fix: pass chat handler not chat formatter for huggingface autotokenizer and tokenizer_config formats.	2024-01-21 18:38:04 -05:00
settings.py	Automatically set chat format from gguf (#1110 )	2024-01-29 14:22:23 -05:00
types.py	server: Support none defaulting to infinity for completions (#111 )	2023-12-22 14:05:13 -05:00