Commit graph

42 commits

Author SHA1 Message Date
Andrei
0f8aa4ab5c
feat: Pull models directly from huggingface (#1206)
* Add from_pretrained method to Llama class

* Update docs

* Merge filename and pattern
2024-02-21 16:25:10 -05:00
Andrei Betlen
03ed547bfd Remove templates doc 2024-01-18 21:23:26 -05:00
Austin
6bfe98bd80
Integration of Jinja2 Templating (#875)
* feat: Add support for jinja templating

Signed-off-by: teleprint-me <77757836+teleprint-me@users.noreply.github.com>

* fix: Refactor chat formatter and update interface for jinja templates

- Simplify the `llama2_template` in `llama_jinja_format.py` by removing unnecessary line breaks for readability without affecting functionality.
- Update `ChatFormatterInterface` constructor to accept a more generic `Optional[object]` type for the template parameter, enhancing flexibility.
- Introduce a `template` property to `ChatFormatterInterface` for standardized access to the template string.
- Replace `MetaSingleton` metaclass with `Singleton` for the `ChatFormatterFactory` to streamline the singleton implementation.

These changes enhance code readability, maintain usability, and ensure consistency in the chat formatter's design pattern usage.

* Add outline for Jinja2 templating integration documentation

Signed-off-by: teleprint-me <77757836+teleprint-me@users.noreply.github.com>

* Add jinja2 as a dependency with version range for Hugging Face transformers compatibility

Signed-off-by: teleprint-me <77757836+teleprint-me@users.noreply.github.com>

* Update jinja2 version constraint for mkdocs-material compatibility

Signed-off-by: teleprint-me <77757836+teleprint-me@users.noreply.github.com>

* Fix attribute name in AutoChatFormatter

- Changed attribute name from `self._renderer` to `self._environment`

---------

Signed-off-by: teleprint-me <77757836+teleprint-me@users.noreply.github.com>
2024-01-17 09:47:52 -05:00
Andrei Betlen
522aecb868 docs: add server config docs 2023-12-22 14:37:24 -05:00
Andrei Betlen
174ef3ddf6 docs: Add headings to API reference 2023-11-27 18:42:15 -05:00
Andrei Betlen
945e20fa2c docs: update link 2023-11-24 00:18:32 -05:00
Andrei Betlen
e6a36b840e docs: edit function calling docs 2023-11-24 00:17:54 -05:00
Andrei Betlen
19e02f1f87 docs: Add link to function calling notebook 2023-11-24 00:15:02 -05:00
Andrei Betlen
21abefa488 docs: Add grammar and types to api reference 2023-11-23 00:27:41 -05:00
Andrei Betlen
e349f314b4 docs: Fix API Reference page 2023-11-22 23:45:02 -05:00
Andrei Betlen
841f6167cc Add Code Completion section to docs 2023-11-10 04:06:14 -05:00
Andrei Betlen
82072802ea Add link to bakllava gguf model 2023-11-09 03:05:18 -05:00
Kevin Jung
fb1f956a27
Fix server doc arguments (#892) 2023-11-08 23:53:00 -05:00
Andrei Betlen
9ae9c86be0 Update server docs 2023-11-08 00:52:13 -05:00
Andrei Betlen
3660230faa Fix docs multi-modal docs 2023-11-07 22:52:08 -05:00
Damian Stewart
aab74f0b2b
Multimodal Support (Llava 1.5) (#821)
* llava v1.5 integration

* Point llama.cpp to fork

* Add llava shared library target

* Fix type

* Update llama.cpp

* Add llava api

* Revert changes to llama and llama_cpp

* Update llava example

* Add types for new gpt-4-vision-preview api

* Fix typo

* Update llama.cpp

* Update llama_types to match OpenAI v1 API

* Update ChatCompletionFunction type

* Reorder request parameters

* More API type fixes

* Even More Type Updates

* Add parameter for custom chat_handler to Llama class

* Fix circular import

* Convert to absolute imports

* Fix

* Fix pydantic Jsontype bug

* Accept list of prompt tokens in create_completion

* Add llava1.5 chat handler

* Add Multimodal notebook

* Clean up examples

* Add server docs

---------

Co-authored-by: Andrei Betlen <abetlen@gmail.com>
2023-11-07 22:48:51 -05:00
ccshen
b76724cddc
Update instruction to download GGUF model (#783)
Co-authored-by: john.shen <john.shen@bioclinica.com>
2023-10-02 11:46:47 -04:00
Andrei Betlen
109123c4f0 docs: Use pymdownx.snippets for easier docs management 2023-09-12 22:28:58 -04:00
Andrei Betlen
1dd3f473c0 Remove references to FORCE_CMAKE 2023-09-12 19:01:16 -04:00
Andrei Betlen
57db1f9570 Update development docs for scikit-build-core. Closes #490 2023-07-18 20:26:25 -04:00
Audrey Roy Greenfeld
d270ec231a Update macOS Metal GPU step 4
* Update "today" to version 0.1.62
* Fix numbering (there were 2 step 4's)
2023-07-07 11:15:04 +01:00
Andrei Betlen
442213b070 Add stopping criteria and logits processor to docs 2023-06-28 21:07:58 -04:00
Andrei Betlen
a3766591bb Update docs 2023-06-27 13:02:30 -04:00
Andrei Betlen
dae983342a Update docs 2023-06-27 12:45:31 -04:00
Andrei Betlen
66b8b979a5 Update readthedocs setup 2023-06-26 16:31:16 -04:00
Andrei Betlen
37d5192a92 Update docs 2023-06-16 10:41:51 -04:00
Ian Scrivener
94f63a66b9
Create macos_install.md
add MacOS Metal markdown install instructions
2023-06-13 09:49:19 +10:00
Andrei Betlen
db10e0078b Update docs 2023-05-17 16:14:01 -04:00
Andrei Betlen
b75fa96bf7 Update docs 2023-04-24 19:56:57 -04:00
Andrei Betlen
b2a24bddac Update docs 2023-04-15 22:31:14 -04:00
Andrei Betlen
196650ccb2 Update model paths to be more clear they should point to file 2023-04-09 22:45:55 -04:00
Andrei Betlen
38f7dea6ca Update README and docs 2023-04-05 17:44:25 -04:00
Andrei Betlen
4615f1e520 Add chat completion method to docs 2023-04-03 20:14:03 -04:00
Andrei Betlen
c0a5c0171f Add embed back into documentation 2023-04-03 18:53:00 -04:00
Andrei Betlen
2bc184dc63 Add new methods to docs 2023-04-02 00:09:51 -04:00
Andrei Betlen
f14a31c936 Document generate method 2023-04-01 17:29:43 -04:00
Andrei Betlen
67c70cc8eb Add static methods for beginning and end of sequence tokens. 2023-04-01 17:29:30 -04:00
Andrei Betlen
0503e7f9b4 Update api 2023-04-01 13:04:12 -04:00
Andrei Betlen
9f975ac44c Add development section 2023-04-01 13:03:56 -04:00
Andrei Betlen
ea41474e04 Add new Llama methods to docs 2023-03-28 05:04:15 -04:00
Andrei Betlen
5533ed7aa8 Update docs 2023-03-24 19:02:36 -04:00
Andrei Betlen
df15caa877 Add mkdocs 2023-03-24 18:57:59 -04:00