llama.cpp

History

ddh0 c96b2daebf feat: Use all available CPUs for batch processing (#1345 )		2024-04-17 10:05:54 -04:00
..
__init__.py	llama_cpp server: app is now importable, still runnable as a module	2023-04-29 11:41:25 -07:00
__main__.py	feat: Add support for yaml based configs	2024-04-10 02:47:01 -04:00
app.py	feat: Add support for yaml based configs	2024-04-10 02:47:01 -04:00
cli.py	Fix python3.8 support	2024-01-19 08:17:49 -05:00
errors.py	misc: Format	2024-02-28 14:27:40 -05:00
model.py	feat: add support for KV cache quantization options (#1307 )	2024-04-01 10:19:28 -04:00
settings.py	feat: Use all available CPUs for batch processing (#1345 )	2024-04-17 10:05:54 -04:00
types.py	feat: Add logprobs support to chat completions (#1311 )	2024-03-31 13:30:13 -04:00