Gary Mulder
|
70f629a72f
|
Update README.md
|
2023-05-23 20:36:21 +01:00 |
|
Gary Mulder
|
eaff7a8678
|
Initial commit of auto docker
|
2023-05-23 19:26:40 +00:00 |
|
Andrei Betlen
|
e5d596e0e9
|
Bump version
|
2023-05-22 23:50:58 -04:00 |
|
Andrei Betlen
|
c41b1ebca7
|
Update llama.cpp
|
2023-05-22 23:50:35 -04:00 |
|
Andrei
|
aa3d7a6299
|
Merge pull request #263 from abetlen/dependabot/pip/mkdocs-material-9.1.14
Bump mkdocs-material from 9.1.12 to 9.1.14
|
2023-05-22 23:44:51 -04:00 |
|
dependabot[bot]
|
2240b949ae
|
Bump mkdocs-material from 9.1.12 to 9.1.14
Bumps [mkdocs-material](https://github.com/squidfunk/mkdocs-material) from 9.1.12 to 9.1.14.
- [Release notes](https://github.com/squidfunk/mkdocs-material/releases)
- [Changelog](https://github.com/squidfunk/mkdocs-material/blob/master/CHANGELOG)
- [Commits](https://github.com/squidfunk/mkdocs-material/compare/9.1.12...9.1.14)
---
updated-dependencies:
- dependency-name: mkdocs-material
dependency-type: direct:development
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
|
2023-05-22 21:18:57 +00:00 |
|
Andrei
|
01c79e7bf1
|
Merge pull request #258 from Pipboyguy/main
Change docker build dynamic param to image instead of cuda version
|
2023-05-22 17:17:17 -04:00 |
|
Andrei
|
c3e80b1714
|
Merge pull request #262 from abetlen/dependabot/pip/httpx-0.24.1
Bump httpx from 0.24.0 to 0.24.1
|
2023-05-22 17:16:16 -04:00 |
|
dependabot[bot]
|
8e41d724ab
|
Bump httpx from 0.24.0 to 0.24.1
Bumps [httpx](https://github.com/encode/httpx) from 0.24.0 to 0.24.1.
- [Release notes](https://github.com/encode/httpx/releases)
- [Changelog](https://github.com/encode/httpx/blob/master/CHANGELOG.md)
- [Commits](https://github.com/encode/httpx/compare/0.24.0...0.24.1)
---
updated-dependencies:
- dependency-name: httpx
dependency-type: direct:development
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
|
2023-05-22 21:05:39 +00:00 |
|
Marcel Coetzee
|
e6639e6620
|
Change docker build dynamic param to image instead of cuda version
Signed-off-by: Marcel Coetzee <marcel@mooncoon.com>
|
2023-05-22 10:10:14 +02:00 |
|
Andrei
|
4f7a6daa25
|
Merge pull request #248 from localagi/main
make git module accessible anonymously
|
2023-05-22 03:15:15 -04:00 |
|
Andrei Betlen
|
0adb9ec37a
|
Use model_name and index in response
|
2023-05-21 21:30:03 -04:00 |
|
Andrei Betlen
|
922b5b2bfd
|
Merge branch 'main' into server-embedding
|
2023-05-21 21:21:38 -04:00 |
|
Andrei Betlen
|
2c45255a0a
|
Bump version
|
2023-05-21 19:24:20 -04:00 |
|
Andrei Betlen
|
cd102e9da1
|
Cache shared library function calls for static tokens
|
2023-05-21 19:18:56 -04:00 |
|
Andrei Betlen
|
b895511cca
|
Fix penalize_nl
|
2023-05-21 18:38:06 -04:00 |
|
Andrei Betlen
|
03e2947b03
|
Fix unnecessary memory allocation while sampling
|
2023-05-21 18:36:34 -04:00 |
|
Andrei Betlen
|
fafe47114c
|
Update llama.cpp
|
2023-05-21 17:47:21 -04:00 |
|
Andrei Betlen
|
8f49ca0287
|
Bump version
|
2023-05-20 08:53:40 -04:00 |
|
Andrei Betlen
|
76b1d2cd20
|
Change properties to functions to match token functions
|
2023-05-20 08:24:06 -04:00 |
|
Andrei Betlen
|
a7ba85834f
|
Add n_ctx, n_vocab, and n_embd properties
|
2023-05-20 08:13:41 -04:00 |
|
Your Name
|
0b079a658c
|
make git module accessible anonymously
|
2023-05-20 02:25:59 +01:00 |
|
Simon Chabot
|
e783f1c191
|
feat: make embedding support list of string as input
makes the /v1/embedding route similar to OpenAI api.
|
2023-05-20 01:23:32 +02:00 |
|
Andrei Betlen
|
01a010be52
|
Fix llama_cpp and Llama type signatures. Closes #221
|
2023-05-19 11:59:33 -04:00 |
|
Andrei Betlen
|
fb57b9470b
|
Merge branch 'main' of github.com:abetlen/llama_cpp_python into main
|
2023-05-19 03:19:32 -04:00 |
|
Andrei Betlen
|
f82d85fbee
|
Bump version
|
2023-05-19 03:19:27 -04:00 |
|
Andrei Betlen
|
c7788c85ab
|
Add Guidance example
|
2023-05-19 03:16:58 -04:00 |
|
Andrei Betlen
|
a8cd169251
|
Bugfix: Stop sequences can be strings
|
2023-05-19 03:15:08 -04:00 |
|
Andrei Betlen
|
f0812c4d8c
|
Add upgrade instructions to the README
|
2023-05-19 02:20:41 -04:00 |
|
Andrei Betlen
|
17d4271b04
|
Fix logprobs for completions and implement for streaming logprobs.
|
2023-05-19 02:20:27 -04:00 |
|
Andrei Betlen
|
a634a2453b
|
Allow first logprob token to be null to match openai api
|
2023-05-19 02:04:57 -04:00 |
|
Andrei Betlen
|
dc39cc0fa4
|
Use server sent events function for streaming completion
|
2023-05-19 02:04:30 -04:00 |
|
Andrei
|
69f9d50090
|
Merge pull request #235 from Pipboyguy/main
Decrement CUDA version and bump Ubuntu
|
2023-05-18 13:42:04 -04:00 |
|
Andrei Betlen
|
f0ec6e615e
|
Stream tokens instead of text chunks
|
2023-05-18 11:35:59 -04:00 |
|
Andrei Betlen
|
21d8f5fa9f
|
Remove unnused union
|
2023-05-18 11:35:15 -04:00 |
|
Marcel Coetzee
|
6ece8a225a
|
Set CUDA_VERSION as build ARG
Signed-off-by: Marcel Coetzee <marcel@mooncoon.com>
|
2023-05-18 16:59:42 +02:00 |
|
Marcel Coetzee
|
6c57d38552
|
Decrement CUDA version and bump Ubuntu
Signed-off-by: Marcel Coetzee <marcel@mooncoon.com>
|
2023-05-18 16:02:42 +02:00 |
|
Andrei Betlen
|
50e136252a
|
Update llama.cpp
|
2023-05-17 16:14:12 -04:00 |
|
Andrei Betlen
|
db10e0078b
|
Update docs
|
2023-05-17 16:14:01 -04:00 |
|
Andrei Betlen
|
61d58e7b35
|
Check for CUDA_PATH before adding
|
2023-05-17 15:26:38 -04:00 |
|
Andrei Betlen
|
7c95895626
|
Merge branch 'main' of github.com:abetlen/llama_cpp_python into main
|
2023-05-17 15:19:32 -04:00 |
|
Andrei
|
47921a312c
|
Merge pull request #225 from aneeshjoy/main
Fixed CUBLAS DLL load issues on Windows
|
2023-05-17 15:17:37 -04:00 |
|
Aneesh Joy
|
e9794f91f2
|
Fixd CUBLAS dll load issue in Windows
|
2023-05-17 18:04:58 +01:00 |
|
Andrei Betlen
|
70695c430b
|
Move docs link up
|
2023-05-17 11:40:12 -04:00 |
|
Andrei Betlen
|
4f342795e5
|
Update token checks
|
2023-05-17 03:35:13 -04:00 |
|
Andrei Betlen
|
626003c884
|
Merge branch 'main' of github.com:abetlen/llama_cpp_python into main
|
2023-05-17 02:00:48 -04:00 |
|
Andrei Betlen
|
f5c2f998ab
|
Format
|
2023-05-17 02:00:39 -04:00 |
|
Andrei Betlen
|
d28b753ed2
|
Implement penalize_nl
|
2023-05-17 01:53:26 -04:00 |
|
Andrei Betlen
|
f11e2a781c
|
Fix last_n_tokens_size
|
2023-05-17 01:42:51 -04:00 |
|
Andrei Betlen
|
7e55244540
|
Fix top_k value. Closes #220
|
2023-05-17 01:41:42 -04:00 |
|