Andrei Betlen
|
dd22010e85
|
fix: Raise exceptions when llama model or context fails to load
|
2024-02-22 00:09:45 -05:00 |
|
Andrei
|
7f51b6071f
|
feat(low-level-api): Improve API static type-safety and performance (#1205)
|
2024-02-21 16:25:38 -05:00 |
|
Douglas Hanley
|
d7a67917ba
|
feat: Support batch embeddings (#1186)
* handle batched embeddings
* fix normalization issue
* fix type hints, ensure no breaking changes to embed
* Clear kv cache / reset internal state after embedding complete
---------
Co-authored-by: Andrei <abetlen@gmail.com>
|
2024-02-14 04:26:09 -05:00 |
|
Andrei Betlen
|
6943bab6d8
|
fix: destructor exception where internal classes are missing some uninitialized attributes
|
2024-02-14 03:38:41 -05:00 |
|
Andrei Betlen
|
59760c85ed
|
fix: Use llama_log_callback to avoid suppress_stdout_stderr
|
2024-02-05 21:52:12 -05:00 |
|
Andrei
|
da003d8768
|
Automatically set chat format from gguf (#1110)
* Use jinja formatter to load chat format from gguf
* Fix off-by-one error in metadata loader
* Implement chat format auto-detection
|
2024-01-29 14:22:23 -05:00 |
|
Andrei Betlen
|
5a34c57e54
|
feat: Expose gguf model metadata in metadata property
|
2024-01-19 10:46:03 -05:00 |
|
Andrei Betlen
|
cc4630e66f
|
Move helper classes to _internals submodule
|
2024-01-17 09:14:00 -05:00 |
|