llama.cpp/llama_cpp/server
Andrei 4d574bd765
feat(server): Add support for pulling models from Huggingface Hub (#1222)
* Basic support for hf pull on server

* Add hf_model_repo_id setting

* Update README
2024-02-26 14:35:08 -05:00
..
__init__.py llama_cpp server: app is now importable, still runnable as a module 2023-04-29 11:41:25 -07:00
__main__.py [Feat] Multi model support (#931) 2023-12-22 05:51:25 -05:00
app.py fix: Use '\n' seperator for EventSourceResponse (#1188) 2024-02-15 15:20:13 -05:00
cli.py Fix python3.8 support 2024-01-19 08:17:49 -05:00
errors.py server: Support none defaulting to infinity for completions (#111) 2023-12-22 14:05:13 -05:00
model.py feat(server): Add support for pulling models from Huggingface Hub (#1222) 2024-02-26 14:35:08 -05:00
settings.py feat(server): Add support for pulling models from Huggingface Hub (#1222) 2024-02-26 14:35:08 -05:00
types.py server: Support none defaulting to infinity for completions (#111) 2023-12-22 14:05:13 -05:00