baalajimaestro/llama.cpp

Author	SHA1	Message	Date
Andrei	fe2da09538	feat: Generic Chat Formats, Tool Calling, and Huggingface Pull Support for Multimodal Models (Obsidian, LLaVA1.6, Moondream) (#1147 ) * Test dummy image tags in chat templates * Format and improve types for llava_cpp.py * Add from_pretrained support to llava chat format. * Refactor llava chat format to use a jinja2 * Revert chat format test * Add moondream support (wip) * Update moondream chat format * Update moondream chat format * Update moondream prompt * Add function calling support * Cache last image embed * Add Llava1.6 support * Add nanollava support * Add obisidian support * Remove unnecessary import * Re-order multimodal chat formats * Logits all no longer required for multi-modal models * Update README.md * Update docs * Update README * Fix typo * Update README * Fix typo	2024-04-30 01:35:38 -04:00
Jeffrey Fong	bce6dc0ac2	docs: Update Functionary OpenAI Server Readme (#1193 ) * update functionary parts in server readme * add write-up about hf tokenizer	2024-02-23 12:24:10 -05:00
Andrei Betlen	522aecb868	docs: add server config docs	2023-12-22 14:37:24 -05:00
Andrei Betlen	945e20fa2c	docs: update link	2023-11-24 00:18:32 -05:00
Andrei Betlen	e6a36b840e	docs: edit function calling docs	2023-11-24 00:17:54 -05:00
Andrei Betlen	19e02f1f87	docs: Add link to function calling notebook	2023-11-24 00:15:02 -05:00
Andrei Betlen	841f6167cc	Add Code Completion section to docs	2023-11-10 04:06:14 -05:00
Andrei Betlen	82072802ea	Add link to bakllava gguf model	2023-11-09 03:05:18 -05:00
Kevin Jung	fb1f956a27	Fix server doc arguments (#892 )	2023-11-08 23:53:00 -05:00
Andrei Betlen	9ae9c86be0	Update server docs	2023-11-08 00:52:13 -05:00
Andrei Betlen	3660230faa	Fix docs multi-modal docs	2023-11-07 22:52:08 -05:00
Damian Stewart	aab74f0b2b	Multimodal Support (Llava 1.5) (#821 ) * llava v1.5 integration * Point llama.cpp to fork * Add llava shared library target * Fix type * Update llama.cpp * Add llava api * Revert changes to llama and llama_cpp * Update llava example * Add types for new gpt-4-vision-preview api * Fix typo * Update llama.cpp * Update llama_types to match OpenAI v1 API * Update ChatCompletionFunction type * Reorder request parameters * More API type fixes * Even More Type Updates * Add parameter for custom chat_handler to Llama class * Fix circular import * Convert to absolute imports * Fix * Fix pydantic Jsontype bug * Accept list of prompt tokens in create_completion * Add llava1.5 chat handler * Add Multimodal notebook * Clean up examples * Add server docs --------- Co-authored-by: Andrei Betlen <abetlen@gmail.com>	2023-11-07 22:48:51 -05:00

12 commits