Commit graph

1233 commits

Author SHA1 Message Date
Maximilian-Winter
c05fcdf42f Fixed none value of logits processors. 2023-05-24 22:02:06 +02:00
Maximilian-Winter
5bb780d455 Implemented logit processors and stop criteria's 2023-05-24 21:55:44 +02:00
Andrei Betlen
fab064ded9 Remove unnecessary ffi calls 2023-05-23 17:56:21 -04:00
Gary Mulder
ec44bdad61
Update README.md 2023-05-23 20:50:39 +01:00
Gary Mulder
ed19071ef8 Renamed and moved old Dockerfiles 2023-05-23 19:38:37 +00:00
Gary Mulder
70f629a72f
Update README.md 2023-05-23 20:36:21 +01:00
Gary Mulder
eaff7a8678 Initial commit of auto docker 2023-05-23 19:26:40 +00:00
Don Mahurin
d6a7adb17a fix "missing 1 required positional argument: 'min_keep'" 2023-05-23 06:42:22 -07:00
Don Mahurin
327eedbfe1 fix "from_bytes() missing required argument 'byteorder'" 2023-05-23 00:20:34 -07:00
Andrei Betlen
e5d596e0e9 Bump version 2023-05-22 23:50:58 -04:00
Andrei Betlen
c41b1ebca7 Update llama.cpp 2023-05-22 23:50:35 -04:00
Andrei
aa3d7a6299
Merge pull request #263 from abetlen/dependabot/pip/mkdocs-material-9.1.14
Bump mkdocs-material from 9.1.12 to 9.1.14
2023-05-22 23:44:51 -04:00
dependabot[bot]
2240b949ae
Bump mkdocs-material from 9.1.12 to 9.1.14
Bumps [mkdocs-material](https://github.com/squidfunk/mkdocs-material) from 9.1.12 to 9.1.14.
- [Release notes](https://github.com/squidfunk/mkdocs-material/releases)
- [Changelog](https://github.com/squidfunk/mkdocs-material/blob/master/CHANGELOG)
- [Commits](https://github.com/squidfunk/mkdocs-material/compare/9.1.12...9.1.14)

---
updated-dependencies:
- dependency-name: mkdocs-material
  dependency-type: direct:development
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2023-05-22 21:18:57 +00:00
Andrei
01c79e7bf1
Merge pull request #258 from Pipboyguy/main
Change docker build dynamic param to image instead of cuda version
2023-05-22 17:17:17 -04:00
Andrei
c3e80b1714
Merge pull request #262 from abetlen/dependabot/pip/httpx-0.24.1
Bump httpx from 0.24.0 to 0.24.1
2023-05-22 17:16:16 -04:00
dependabot[bot]
8e41d724ab
Bump httpx from 0.24.0 to 0.24.1
Bumps [httpx](https://github.com/encode/httpx) from 0.24.0 to 0.24.1.
- [Release notes](https://github.com/encode/httpx/releases)
- [Changelog](https://github.com/encode/httpx/blob/master/CHANGELOG.md)
- [Commits](https://github.com/encode/httpx/compare/0.24.0...0.24.1)

---
updated-dependencies:
- dependency-name: httpx
  dependency-type: direct:development
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2023-05-22 21:05:39 +00:00
Marcel Coetzee
e6639e6620 Change docker build dynamic param to image instead of cuda version
Signed-off-by: Marcel Coetzee <marcel@mooncoon.com>
2023-05-22 10:10:14 +02:00
Andrei
4f7a6daa25
Merge pull request #248 from localagi/main
make git module accessible anonymously
2023-05-22 03:15:15 -04:00
Andrei Betlen
0adb9ec37a Use model_name and index in response 2023-05-21 21:30:03 -04:00
Andrei Betlen
922b5b2bfd Merge branch 'main' into server-embedding 2023-05-21 21:21:38 -04:00
Andrei Betlen
2c45255a0a Bump version 2023-05-21 19:24:20 -04:00
Andrei Betlen
cd102e9da1 Cache shared library function calls for static tokens 2023-05-21 19:18:56 -04:00
Andrei Betlen
b895511cca Fix penalize_nl 2023-05-21 18:38:06 -04:00
Andrei Betlen
03e2947b03 Fix unnecessary memory allocation while sampling 2023-05-21 18:36:34 -04:00
Andrei Betlen
fafe47114c Update llama.cpp 2023-05-21 17:47:21 -04:00
Andrei Betlen
8f49ca0287 Bump version 2023-05-20 08:53:40 -04:00
Andrei Betlen
76b1d2cd20 Change properties to functions to match token functions 2023-05-20 08:24:06 -04:00
Andrei Betlen
a7ba85834f Add n_ctx, n_vocab, and n_embd properties 2023-05-20 08:13:41 -04:00
Your Name
0b079a658c make git module accessible anonymously 2023-05-20 02:25:59 +01:00
Simon Chabot
e783f1c191 feat: make embedding support list of string as input
makes the /v1/embedding route similar to OpenAI api.
2023-05-20 01:23:32 +02:00
Andrei Betlen
01a010be52 Fix llama_cpp and Llama type signatures. Closes #221 2023-05-19 11:59:33 -04:00
Andrei Betlen
fb57b9470b Merge branch 'main' of github.com:abetlen/llama_cpp_python into main 2023-05-19 03:19:32 -04:00
Andrei Betlen
f82d85fbee Bump version 2023-05-19 03:19:27 -04:00
Andrei Betlen
c7788c85ab Add Guidance example 2023-05-19 03:16:58 -04:00
Andrei Betlen
a8cd169251 Bugfix: Stop sequences can be strings 2023-05-19 03:15:08 -04:00
Andrei Betlen
f0812c4d8c Add upgrade instructions to the README 2023-05-19 02:20:41 -04:00
Andrei Betlen
17d4271b04 Fix logprobs for completions and implement for streaming logprobs. 2023-05-19 02:20:27 -04:00
Andrei Betlen
a634a2453b Allow first logprob token to be null to match openai api 2023-05-19 02:04:57 -04:00
Andrei Betlen
dc39cc0fa4 Use server sent events function for streaming completion 2023-05-19 02:04:30 -04:00
Andrei
69f9d50090
Merge pull request #235 from Pipboyguy/main
Decrement CUDA version and bump Ubuntu
2023-05-18 13:42:04 -04:00
Andrei Betlen
f0ec6e615e Stream tokens instead of text chunks 2023-05-18 11:35:59 -04:00
Andrei Betlen
21d8f5fa9f Remove unnused union 2023-05-18 11:35:15 -04:00
Marcel Coetzee
6ece8a225a Set CUDA_VERSION as build ARG
Signed-off-by: Marcel Coetzee <marcel@mooncoon.com>
2023-05-18 16:59:42 +02:00
Marcel Coetzee
6c57d38552 Decrement CUDA version and bump Ubuntu
Signed-off-by: Marcel Coetzee <marcel@mooncoon.com>
2023-05-18 16:02:42 +02:00
Andrei Betlen
50e136252a Update llama.cpp 2023-05-17 16:14:12 -04:00
Andrei Betlen
db10e0078b Update docs 2023-05-17 16:14:01 -04:00
Andrei Betlen
61d58e7b35 Check for CUDA_PATH before adding 2023-05-17 15:26:38 -04:00
Andrei Betlen
7c95895626 Merge branch 'main' of github.com:abetlen/llama_cpp_python into main 2023-05-17 15:19:32 -04:00
Andrei
47921a312c
Merge pull request #225 from aneeshjoy/main
Fixed CUBLAS DLL load issues on Windows
2023-05-17 15:17:37 -04:00
Aneesh Joy
e9794f91f2
Fixd CUBLAS dll load issue in Windows 2023-05-17 18:04:58 +01:00