llama.cpp

History

Dave 12b7f2f4e9 [Feat] Multi model support (#931 ) * Update Llama class to handle chat_format & caching * Add settings.py * Add util.py & update __main__.py * multimodel * update settings.py * cleanup * delete util.py * Fix /v1/models endpoint * MultiLlama now iterable, app check-alive on "/" * instant model init if file is given * backward compability * revert model param mandatory * fix error * handle individual model config json * refactor * revert chathandler/clip_model changes * handle chat_handler in MulitLlama() * split settings into server/llama * reduce global vars * Update LlamaProxy to handle config files * Add free method to LlamaProxy * update arg parsers & install server alias * refactor cache settings * change server executable name * better var name * whitespace * Revert "whitespace" This reverts commit bc5cf51c64a95bfc9926e1bc58166059711a1cd8. * remove exe_name * Fix merge bugs * Fix type annotations * Fix type annotations * Fix uvicorn app factory * Fix settings * Refactor server * Remove formatting fix * Format * Use default model if not found in model settings * Fix * Cleanup * Fix * Fix * Remove unnused CommandLineSettings * Cleanup * Support default name for copilot-codex models --------- Co-authored-by: Andrei Betlen <abetlen@gmail.com>		2023-12-22 05:51:25 -05:00
..
__init__.py	llama_cpp server: app is now importable, still runnable as a module	2023-04-29 11:41:25 -07:00
__main__.py	[Feat] Multi model support (#931 )	2023-12-22 05:51:25 -05:00
app.py	[Feat] Multi model support (#931 )	2023-12-22 05:51:25 -05:00
cli.py	[Feat] Multi model support (#931 )	2023-12-22 05:51:25 -05:00
errors.py	[Feat] Multi model support (#931 )	2023-12-22 05:51:25 -05:00
model.py	[Feat] Multi model support (#931 )	2023-12-22 05:51:25 -05:00
settings.py	[Feat] Multi model support (#931 )	2023-12-22 05:51:25 -05:00
types.py	[Feat] Multi model support (#931 )	2023-12-22 05:51:25 -05:00