Commit graph

12 commits

Author SHA1 Message Date
Andrei
fe2da09538
feat: Generic Chat Formats, Tool Calling, and Huggingface Pull Support for Multimodal Models (Obsidian, LLaVA1.6, Moondream) (#1147)
* Test dummy image tags in chat templates

* Format and improve  types for llava_cpp.py

* Add from_pretrained support to llava chat format.

* Refactor llava chat format to use a jinja2

* Revert chat format test

* Add moondream support (wip)

* Update moondream chat format

* Update moondream chat format

* Update moondream prompt

* Add function calling support

* Cache last image embed

* Add Llava1.6 support

* Add nanollava support

* Add obisidian support

* Remove unnecessary import

* Re-order multimodal chat formats

* Logits all no longer required for multi-modal models

* Update README.md

* Update docs

* Update README

* Fix typo

* Update README

* Fix typo
2024-04-30 01:35:38 -04:00
Jeffrey Fong
bce6dc0ac2
docs: Update Functionary OpenAI Server Readme (#1193)
* update functionary parts in server readme

* add write-up about hf tokenizer
2024-02-23 12:24:10 -05:00
Andrei Betlen
522aecb868 docs: add server config docs 2023-12-22 14:37:24 -05:00
Andrei Betlen
945e20fa2c docs: update link 2023-11-24 00:18:32 -05:00
Andrei Betlen
e6a36b840e docs: edit function calling docs 2023-11-24 00:17:54 -05:00
Andrei Betlen
19e02f1f87 docs: Add link to function calling notebook 2023-11-24 00:15:02 -05:00
Andrei Betlen
841f6167cc Add Code Completion section to docs 2023-11-10 04:06:14 -05:00
Andrei Betlen
82072802ea Add link to bakllava gguf model 2023-11-09 03:05:18 -05:00
Kevin Jung
fb1f956a27
Fix server doc arguments (#892) 2023-11-08 23:53:00 -05:00
Andrei Betlen
9ae9c86be0 Update server docs 2023-11-08 00:52:13 -05:00
Andrei Betlen
3660230faa Fix docs multi-modal docs 2023-11-07 22:52:08 -05:00
Damian Stewart
aab74f0b2b
Multimodal Support (Llava 1.5) (#821)
* llava v1.5 integration

* Point llama.cpp to fork

* Add llava shared library target

* Fix type

* Update llama.cpp

* Add llava api

* Revert changes to llama and llama_cpp

* Update llava example

* Add types for new gpt-4-vision-preview api

* Fix typo

* Update llama.cpp

* Update llama_types to match OpenAI v1 API

* Update ChatCompletionFunction type

* Reorder request parameters

* More API type fixes

* Even More Type Updates

* Add parameter for custom chat_handler to Llama class

* Fix circular import

* Convert to absolute imports

* Fix

* Fix pydantic Jsontype bug

* Accept list of prompt tokens in create_completion

* Add llava1.5 chat handler

* Add Multimodal notebook

* Clean up examples

* Add server docs

---------

Co-authored-by: Andrei Betlen <abetlen@gmail.com>
2023-11-07 22:48:51 -05:00