Andrei Betlen
|
c25b7dfc86
|
Bump version
|
2023-04-01 13:06:05 -04:00 |
|
Andrei Betlen
|
ed6f2a049e
|
Add streaming and embedding endpoints to fastapi example
|
2023-04-01 13:05:20 -04:00 |
|
Andrei Betlen
|
0503e7f9b4
|
Update api
|
2023-04-01 13:04:12 -04:00 |
|
Andrei Betlen
|
9f975ac44c
|
Add development section
|
2023-04-01 13:03:56 -04:00 |
|
Andrei Betlen
|
9fac0334b2
|
Update embedding example to new api
|
2023-04-01 13:02:51 -04:00 |
|
Andrei Betlen
|
5e011145c5
|
Update low level api example
|
2023-04-01 13:02:10 -04:00 |
|
Andrei Betlen
|
5f2e822b59
|
Rename inference example
|
2023-04-01 13:01:45 -04:00 |
|
Andrei Betlen
|
318eae237e
|
Update high-level api
|
2023-04-01 13:01:27 -04:00 |
|
Andrei Betlen
|
3af274cbd4
|
Update llama.cpp
|
2023-04-01 13:00:09 -04:00 |
|
Andrei Betlen
|
69e7d9f60e
|
Add type definitions
|
2023-04-01 12:59:58 -04:00 |
|
Andrei Betlen
|
49c8df369a
|
Fix type signature of token_to_str
|
2023-03-31 03:25:12 -04:00 |
|
Andrei Betlen
|
670d390001
|
Fix ctypes typing issue for Arrays
|
2023-03-31 03:20:15 -04:00 |
|
Andrei Betlen
|
1545b22727
|
Fix array type signatures
|
2023-03-31 02:08:20 -04:00 |
|
Andrei Betlen
|
4b9eb5c19e
|
Add search to mkdocs
|
2023-03-31 00:01:53 -04:00 |
|
Andrei Betlen
|
f5e03805f7
|
Update llama.cpp
|
2023-03-31 00:00:43 -04:00 |
|
Andrei Betlen
|
c928e0afc8
|
Formatting
|
2023-03-31 00:00:27 -04:00 |
|
Andrei Betlen
|
8d9560ed66
|
Add typing-extensions dependency
|
2023-03-30 06:43:31 -04:00 |
|
Andrei Betlen
|
a596362c44
|
Add minimum python version, typing-extensions dependency, and long description for PyPI
|
2023-03-30 06:42:54 -04:00 |
|
Andrei Betlen
|
51a92b5146
|
Bump version
|
2023-03-28 21:10:49 -04:00 |
|
Andrei Betlen
|
8908f4614c
|
Update llama.cpp
|
2023-03-28 21:10:23 -04:00 |
|
Andrei Betlen
|
ea41474e04
|
Add new Llama methods to docs
|
2023-03-28 05:04:15 -04:00 |
|
Andrei Betlen
|
f11e9ae939
|
Bump version
|
2023-03-28 05:00:31 -04:00 |
|
Andrei Betlen
|
70b8a1ef75
|
Add support to get embeddings from high-level api. Closes #4
|
2023-03-28 04:59:54 -04:00 |
|
Andrei Betlen
|
9ba5c3c3b7
|
Bump version
|
2023-03-28 04:04:35 -04:00 |
|
Andrei Betlen
|
3dbb3fd3f6
|
Add support for stream parameter. Closes #1
|
2023-03-28 04:03:57 -04:00 |
|
Andrei Betlen
|
30fc0f3866
|
Extract generate method
|
2023-03-28 02:42:22 -04:00 |
|
Andrei Betlen
|
1c823f6d0f
|
Refactor Llama class and add tokenize / detokenize methods Closes #3
|
2023-03-28 01:45:37 -04:00 |
|
Andrei Betlen
|
6dbff7679c
|
Add docs link
|
2023-03-27 18:30:12 -04:00 |
|
Andrei Betlen
|
c210635c9b
|
Update llama.cpp
|
2023-03-27 01:35:51 -04:00 |
|
Andrei Betlen
|
0ea84df91c
|
Update llama.cpp
|
2023-03-26 14:00:37 -04:00 |
|
Andrei Betlen
|
4250380c0a
|
Bump version
|
2023-03-25 16:26:35 -04:00 |
|
Andrei Betlen
|
8ae3beda9c
|
Update Llama to add params
|
2023-03-25 16:26:23 -04:00 |
|
Andrei Betlen
|
4525236214
|
Update llama.cpp
|
2023-03-25 16:26:03 -04:00 |
|
Andrei Betlen
|
b121b7c05b
|
Update docstring
|
2023-03-25 12:33:18 -04:00 |
|
Andrei Betlen
|
206efa39df
|
Bump version
|
2023-03-25 12:12:39 -04:00 |
|
Andrei Betlen
|
fa92740a10
|
Update llama.cpp
|
2023-03-25 12:12:09 -04:00 |
|
Andrei Betlen
|
dfe8608096
|
Update examples
|
2023-03-24 19:10:31 -04:00 |
|
Andrei Betlen
|
5533ed7aa8
|
Update docs
|
2023-03-24 19:02:36 -04:00 |
|
Andrei Betlen
|
cbf8a62b64
|
Add repo url
|
2023-03-24 18:59:02 -04:00 |
|
Andrei Betlen
|
df15caa877
|
Add mkdocs
|
2023-03-24 18:57:59 -04:00 |
|
Andrei Betlen
|
a61fd3b509
|
Add example based on stripped down version of main.cpp from llama.cpp
|
2023-03-24 18:57:25 -04:00 |
|
Andrei Betlen
|
da9b71cfe5
|
Bump version
|
2023-03-24 18:44:04 -04:00 |
|
Andrei Betlen
|
4da5faa28b
|
Bugfix: cross-platform method to find shared lib
|
2023-03-24 18:43:29 -04:00 |
|
Andrei Betlen
|
b93675608a
|
Handle errors returned by llama.cpp
|
2023-03-24 15:47:17 -04:00 |
|
Andrei Betlen
|
bcde1f19b7
|
Bump version
|
2023-03-24 15:00:10 -04:00 |
|
Andrei Betlen
|
7786edb0f9
|
Black formatting
|
2023-03-24 14:59:29 -04:00 |
|
Andrei Betlen
|
c784d83131
|
Update llama.cpp and re-organize low-level api
|
2023-03-24 14:58:42 -04:00 |
|
Andrei Betlen
|
b9c53b88a1
|
Use n_ctx provided from actual context not params
|
2023-03-24 14:58:10 -04:00 |
|
Andrei Betlen
|
2cc499512c
|
Black formatting
|
2023-03-24 14:35:41 -04:00 |
|
Andrei Betlen
|
d29b05bb67
|
Update example to match alpaca training prompt
|
2023-03-24 14:34:15 -04:00 |
|