server
|
Add speculative decoding (#1120)
|
2024-01-31 14:08:14 -05:00 |
__init__.py
|
Bump version
|
2024-01-31 15:10:18 -05:00 |
llama.py
|
Add speculative decoding (#1120)
|
2024-01-31 14:08:14 -05:00 |
llama_cpp.py
|
Update llama.cpp
|
2024-01-31 10:41:42 -05:00 |
llama_speculative.py
|
Add speculative decoding (#1120)
|
2024-01-31 14:08:14 -05:00 |
llama_types.py
|
Add json schema mode (#1122)
|
2024-01-27 16:52:18 -05:00 |
llava_cpp.py
|
Make building llava optional
|
2023-11-28 04:55:21 -05:00 |
py.typed
|
Add py.typed
|
2023-08-11 09:58:48 +02:00 |