llama.cpp

History

Kyle Mistele 9c36688b33 fix(cli): allow passing n_ctx=0 to openAI API server args to use model n_ctx_train field per #1015 (#1093 )		2024-01-16 18:54:06 -05:00
..
__init__.py	llama_cpp server: app is now importable, still runnable as a module	2023-04-29 11:41:25 -07:00
__main__.py	[Feat] Multi model support (#931 )	2023-12-22 05:51:25 -05:00
app.py	Support Accept text/event-stream in chat and completion endpoints, resolves #1083 (#1088 )	2024-01-16 12:52:52 -05:00
cli.py	[Feat] Multi model support (#931 )	2023-12-22 05:51:25 -05:00
errors.py	server: Support none defaulting to infinity for completions (#111 )	2023-12-22 14:05:13 -05:00
model.py	Implement GGUF metadata KV overrides (#1011 )	2024-01-15 12:29:29 -05:00
settings.py	fix(cli): allow passing n_ctx=0 to openAI API server args to use model n_ctx_train field per #1015 (#1093 )	2024-01-16 18:54:06 -05:00
types.py	server: Support none defaulting to infinity for completions (#111 )	2023-12-22 14:05:13 -05:00