baalajimaestro/llama.cpp

Author	SHA1	Message	Date
Andrei	fe2da09538	feat: Generic Chat Formats, Tool Calling, and Huggingface Pull Support for Multimodal Models (Obsidian, LLaVA1.6, Moondream) (#1147 ) * Test dummy image tags in chat templates * Format and improve types for llava_cpp.py * Add from_pretrained support to llava chat format. * Refactor llava chat format to use a jinja2 * Revert chat format test * Add moondream support (wip) * Update moondream chat format * Update moondream chat format * Update moondream prompt * Add function calling support * Cache last image embed * Add Llava1.6 support * Add nanollava support * Add obisidian support * Remove unnecessary import * Re-order multimodal chat formats * Logits all no longer required for multi-modal models * Update README.md * Update docs * Update README * Fix typo * Update README * Fix typo	2024-04-30 01:35:38 -04:00
Jeffrey Fong	bce6dc0ac2	docs: Update Functionary OpenAI Server Readme (#1193 ) * update functionary parts in server readme * add write-up about hf tokenizer	2024-02-23 12:24:10 -05:00
Andrei Betlen	14191e9036	docs: Add create_chat_completion_openai_v1 to api reference	2024-02-21 16:26:49 -05:00
Andrei	0f8aa4ab5c	feat: Pull models directly from huggingface (#1206 ) * Add from_pretrained method to Llama class * Update docs * Merge filename and pattern	2024-02-21 16:25:10 -05:00
Andrei Betlen	03ed547bfd	Remove templates doc	2024-01-18 21:23:26 -05:00
Austin	6bfe98bd80	Integration of Jinja2 Templating (#875 ) * feat: Add support for jinja templating Signed-off-by: teleprint-me <77757836+teleprint-me@users.noreply.github.com> * fix: Refactor chat formatter and update interface for jinja templates - Simplify the `llama2_template` in `llama_jinja_format.py` by removing unnecessary line breaks for readability without affecting functionality. - Update `ChatFormatterInterface` constructor to accept a more generic `Optional[object]` type for the template parameter, enhancing flexibility. - Introduce a `template` property to `ChatFormatterInterface` for standardized access to the template string. - Replace `MetaSingleton` metaclass with `Singleton` for the `ChatFormatterFactory` to streamline the singleton implementation. These changes enhance code readability, maintain usability, and ensure consistency in the chat formatter's design pattern usage. * Add outline for Jinja2 templating integration documentation Signed-off-by: teleprint-me <77757836+teleprint-me@users.noreply.github.com> * Add jinja2 as a dependency with version range for Hugging Face transformers compatibility Signed-off-by: teleprint-me <77757836+teleprint-me@users.noreply.github.com> * Update jinja2 version constraint for mkdocs-material compatibility Signed-off-by: teleprint-me <77757836+teleprint-me@users.noreply.github.com> * Fix attribute name in AutoChatFormatter - Changed attribute name from `self._renderer` to `self._environment` --------- Signed-off-by: teleprint-me <77757836+teleprint-me@users.noreply.github.com>	2024-01-17 09:47:52 -05:00
Andrei Betlen	522aecb868	docs: add server config docs	2023-12-22 14:37:24 -05:00
Andrei Betlen	174ef3ddf6	docs: Add headings to API reference	2023-11-27 18:42:15 -05:00
Andrei Betlen	945e20fa2c	docs: update link	2023-11-24 00:18:32 -05:00
Andrei Betlen	e6a36b840e	docs: edit function calling docs	2023-11-24 00:17:54 -05:00
Andrei Betlen	19e02f1f87	docs: Add link to function calling notebook	2023-11-24 00:15:02 -05:00
Andrei Betlen	21abefa488	docs: Add grammar and types to api reference	2023-11-23 00:27:41 -05:00
Andrei Betlen	e349f314b4	docs: Fix API Reference page	2023-11-22 23:45:02 -05:00
Andrei Betlen	841f6167cc	Add Code Completion section to docs	2023-11-10 04:06:14 -05:00
Andrei Betlen	82072802ea	Add link to bakllava gguf model	2023-11-09 03:05:18 -05:00
Kevin Jung	fb1f956a27	Fix server doc arguments (#892 )	2023-11-08 23:53:00 -05:00
Andrei Betlen	9ae9c86be0	Update server docs	2023-11-08 00:52:13 -05:00
Andrei Betlen	3660230faa	Fix docs multi-modal docs	2023-11-07 22:52:08 -05:00
Damian Stewart	aab74f0b2b	Multimodal Support (Llava 1.5) (#821 ) * llava v1.5 integration * Point llama.cpp to fork * Add llava shared library target * Fix type * Update llama.cpp * Add llava api * Revert changes to llama and llama_cpp * Update llava example * Add types for new gpt-4-vision-preview api * Fix typo * Update llama.cpp * Update llama_types to match OpenAI v1 API * Update ChatCompletionFunction type * Reorder request parameters * More API type fixes * Even More Type Updates * Add parameter for custom chat_handler to Llama class * Fix circular import * Convert to absolute imports * Fix * Fix pydantic Jsontype bug * Accept list of prompt tokens in create_completion * Add llava1.5 chat handler * Add Multimodal notebook * Clean up examples * Add server docs --------- Co-authored-by: Andrei Betlen <abetlen@gmail.com>	2023-11-07 22:48:51 -05:00
ccshen	b76724cddc	Update instruction to download GGUF model (#783 ) Co-authored-by: john.shen <john.shen@bioclinica.com>	2023-10-02 11:46:47 -04:00
Andrei Betlen	109123c4f0	docs: Use pymdownx.snippets for easier docs management	2023-09-12 22:28:58 -04:00
Andrei Betlen	1dd3f473c0	Remove references to FORCE_CMAKE	2023-09-12 19:01:16 -04:00
Andrei Betlen	57db1f9570	Update development docs for scikit-build-core. Closes #490	2023-07-18 20:26:25 -04:00
Audrey Roy Greenfeld	d270ec231a	Update macOS Metal GPU step 4 * Update "today" to version 0.1.62 * Fix numbering (there were 2 step 4's)	2023-07-07 11:15:04 +01:00
Andrei Betlen	442213b070	Add stopping criteria and logits processor to docs	2023-06-28 21:07:58 -04:00
Andrei Betlen	a3766591bb	Update docs	2023-06-27 13:02:30 -04:00
Andrei Betlen	dae983342a	Update docs	2023-06-27 12:45:31 -04:00
Andrei Betlen	66b8b979a5	Update readthedocs setup	2023-06-26 16:31:16 -04:00
Andrei Betlen	37d5192a92	Update docs	2023-06-16 10:41:51 -04:00
Ian Scrivener	94f63a66b9	Create macos_install.md add MacOS Metal markdown install instructions	2023-06-13 09:49:19 +10:00
Andrei Betlen	db10e0078b	Update docs	2023-05-17 16:14:01 -04:00
Andrei Betlen	b75fa96bf7	Update docs	2023-04-24 19:56:57 -04:00
Andrei Betlen	b2a24bddac	Update docs	2023-04-15 22:31:14 -04:00
Andrei Betlen	196650ccb2	Update model paths to be more clear they should point to file	2023-04-09 22:45:55 -04:00
Andrei Betlen	38f7dea6ca	Update README and docs	2023-04-05 17:44:25 -04:00
Andrei Betlen	4615f1e520	Add chat completion method to docs	2023-04-03 20:14:03 -04:00
Andrei Betlen	c0a5c0171f	Add embed back into documentation	2023-04-03 18:53:00 -04:00
Andrei Betlen	2bc184dc63	Add new methods to docs	2023-04-02 00:09:51 -04:00
Andrei Betlen	f14a31c936	Document generate method	2023-04-01 17:29:43 -04:00
Andrei Betlen	67c70cc8eb	Add static methods for beginning and end of sequence tokens.	2023-04-01 17:29:30 -04:00
Andrei Betlen	0503e7f9b4	Update api	2023-04-01 13:04:12 -04:00
Andrei Betlen	9f975ac44c	Add development section	2023-04-01 13:03:56 -04:00
Andrei Betlen	ea41474e04	Add new Llama methods to docs	2023-03-28 05:04:15 -04:00
Andrei Betlen	5533ed7aa8	Update docs	2023-03-24 19:02:36 -04:00
Andrei Betlen	df15caa877	Add mkdocs	2023-03-24 18:57:59 -04:00

45 commits