Andrei Betlen
|
98bbd1c6a8
|
Fix eval logits type
|
2023-05-05 14:23:14 -04:00 |
|
Andrei Betlen
|
b5f3e74627
|
Add return type annotations for embeddings and logits
|
2023-05-05 14:22:55 -04:00 |
|
Andrei Betlen
|
3e28e0e50c
|
Fix: runtime type errors
|
2023-05-05 14:12:26 -04:00 |
|
Andrei Betlen
|
e24c3d7447
|
Prefer explicit imports
|
2023-05-05 14:05:31 -04:00 |
|
Andrei Betlen
|
40501435c1
|
Fix: types
|
2023-05-05 14:04:12 -04:00 |
|
Andrei Betlen
|
66e28eb548
|
Fix temperature bug
|
2023-05-05 14:00:41 -04:00 |
|
Andrei Betlen
|
6702d2abfd
|
Fix candidates type
|
2023-05-05 14:00:30 -04:00 |
|
Andrei Betlen
|
5e7ddfc3d6
|
Fix llama_cpp types
|
2023-05-05 13:54:22 -04:00 |
|
Andrei Betlen
|
b6a9a0b6ba
|
Add types for all low-level api functions
|
2023-05-05 12:22:27 -04:00 |
|
Andrei Betlen
|
5be0efa5f8
|
Cache should raise KeyError when key is missing
|
2023-05-05 12:21:49 -04:00 |
|
Andrei Betlen
|
24fc38754b
|
Add cli options to server. Closes #37
|
2023-05-05 12:08:28 -04:00 |
|
Andrei Betlen
|
5f583b0179
|
Merge branch 'main' of github.com:abetlen/llama_cpp_python into main
|
2023-05-04 21:59:40 -04:00 |
|
Andrei Betlen
|
5c165a85da
|
Bump version
|
2023-05-04 21:59:37 -04:00 |
|
Andrei Betlen
|
853dc711cc
|
Format
|
2023-05-04 21:58:36 -04:00 |
|
Andrei Betlen
|
97c6372350
|
Rewind model to longest prefix.
|
2023-05-04 21:58:27 -04:00 |
|
Andrei
|
38b8eeea58
|
Merge pull request #154 from th-neu/th-neu-dockerfile-slim
Slim-Bullseye based docker image
|
2023-05-04 19:59:23 -04:00 |
|
Thomas Neu
|
5672ed7fea
|
Merge branch 'abetlen:main' into th-neu-dockerfile-slim
|
2023-05-04 21:41:13 +02:00 |
|
Thomas Neu
|
501321875f
|
Slim-Bullseye based docker image
ends up at ~669MB
|
2023-05-04 21:03:19 +02:00 |
|
Andrei Betlen
|
cabd8b8ed1
|
Bump version
|
2023-05-04 12:21:20 -04:00 |
|
Andrei Betlen
|
d78cec67df
|
Update llama.cpp
|
2023-05-04 12:20:25 -04:00 |
|
Andrei Betlen
|
329297fafb
|
Bugfix: Missing logits_to_logprobs
|
2023-05-04 12:18:40 -04:00 |
|
Andrei Betlen
|
d594892fd4
|
Remove Docker CUDA build job
|
2023-05-04 00:02:46 -04:00 |
|
Andrei Betlen
|
0607f6578e
|
Use network installer for cuda
|
2023-05-03 23:22:16 -04:00 |
|
Andrei Betlen
|
6d3c20e39d
|
Add CUDA docker image build to github actions
|
2023-05-03 22:20:53 -04:00 |
|
Andrei Betlen
|
a02aa121da
|
Remove cuda build job
|
2023-05-03 10:50:48 -04:00 |
|
Andrei Betlen
|
07a56dd9c2
|
Update job name
|
2023-05-03 10:39:39 -04:00 |
|
Andrei Betlen
|
7839eb14d3
|
Add docker cuda image. Closes #143
|
2023-05-03 10:29:05 -04:00 |
|
Andrei Betlen
|
9e5b6d675a
|
Improve logging messages
|
2023-05-03 10:28:10 -04:00 |
|
Andrei Betlen
|
43f2907e3a
|
Support smaller state sizes
|
2023-05-03 09:33:50 -04:00 |
|
Andrei Betlen
|
1d47cce222
|
Update llama.cpp
|
2023-05-03 09:33:30 -04:00 |
|
Andrei Betlen
|
c2e31eecee
|
Update permissions
|
2023-05-02 01:23:17 -04:00 |
|
Andrei Betlen
|
63f8d3a6fb
|
Update context
|
2023-05-02 01:16:44 -04:00 |
|
Andrei Betlen
|
c21a34506e
|
Update permsissions
|
2023-05-02 01:13:43 -04:00 |
|
Andrei Betlen
|
872b2ec33f
|
Clone submodules
|
2023-05-02 01:11:34 -04:00 |
|
Andrei Betlen
|
62de4692f2
|
Fix missing dependency
|
2023-05-02 01:09:27 -04:00 |
|
Andrei
|
25062cecd3
|
Merge pull request #140 from abetlen/Niek/main
Add Dockerfile
|
2023-05-02 01:06:00 -04:00 |
|
Andrei Betlen
|
36c81489e7
|
Remove docker section of publish
|
2023-05-02 01:04:36 -04:00 |
|
Andrei Betlen
|
5d5421b29d
|
Add build docker
|
2023-05-02 01:04:02 -04:00 |
|
Andrei Betlen
|
81631afc48
|
Install from local directory
|
2023-05-02 00:55:51 -04:00 |
|
Andrei Betlen
|
d605408f99
|
Add dockerignore
|
2023-05-02 00:55:34 -04:00 |
|
Andrei
|
e644e75915
|
Merge pull request #139 from matthoffner/patch-1
Fix FTYPE typo
|
2023-05-02 00:33:45 -04:00 |
|
Matt Hoffner
|
f97ff3c5bb
|
Update llama_cpp.py
|
2023-05-01 20:40:06 -07:00 |
|
Andrei Betlen
|
e9e0654aed
|
Bump version
|
2023-05-01 22:52:25 -04:00 |
|
Andrei Betlen
|
46e3c4b84a
|
Fix
|
2023-05-01 22:41:54 -04:00 |
|
Andrei Betlen
|
9eafc4c49a
|
Refactor server to use factory
|
2023-05-01 22:38:46 -04:00 |
|
Andrei Betlen
|
dd9ad1c759
|
Formatting
|
2023-05-01 21:51:16 -04:00 |
|
Andrei Betlen
|
9d60ae56f2
|
Fix whitespace
|
2023-05-01 18:07:45 -04:00 |
|
Andrei Betlen
|
53c0129eb6
|
Update submoduele clone instructions
|
2023-05-01 18:07:15 -04:00 |
|
Andrei Betlen
|
b6747f722e
|
Fix logprob calculation. Fixes #134
|
2023-05-01 17:45:08 -04:00 |
|
Andrei Betlen
|
c088a2b3a7
|
Un-skip tests
|
2023-05-01 15:46:03 -04:00 |
|