llama.cpp

History

Andrei fe2da09538 feat: Generic Chat Formats, Tool Calling, and Huggingface Pull Support for Multimodal Models (Obsidian, LLaVA1.6, Moondream) (#1147 ) * Test dummy image tags in chat templates * Format and improve types for llava_cpp.py * Add from_pretrained support to llava chat format. * Refactor llava chat format to use a jinja2 * Revert chat format test * Add moondream support (wip) * Update moondream chat format * Update moondream chat format * Update moondream prompt * Add function calling support * Cache last image embed * Add Llava1.6 support * Add nanollava support * Add obisidian support * Remove unnecessary import * Re-order multimodal chat formats * Logits all no longer required for multi-modal models * Update README.md * Update docs * Update README * Fix typo * Update README * Fix typo		2024-04-30 01:35:38 -04:00
..
__init__.py	llama_cpp server: app is now importable, still runnable as a module	2023-04-29 11:41:25 -07:00
__main__.py	feat: Add support for yaml based configs	2024-04-10 02:47:01 -04:00
app.py	feat: add `disable_ping_events` flag (#1257 )	2024-04-17 10:08:19 -04:00
cli.py	Fix python3.8 support	2024-01-19 08:17:49 -05:00
errors.py	misc: Format	2024-02-28 14:27:40 -05:00
model.py	feat: Generic Chat Formats, Tool Calling, and Huggingface Pull Support for Multimodal Models (Obsidian, LLaVA1.6, Moondream) (#1147 )	2024-04-30 01:35:38 -04:00
settings.py	fix: pydantic deprecation warning	2024-04-25 21:21:48 -04:00
types.py	feat: Add logprobs support to chat completions (#1311 )	2024-03-31 13:30:13 -04:00