Commit graph

745 commits

Author SHA1 Message Date
Gary Mulder
cf4931a400 Working Open Llama 3B in a box 2023-06-02 08:48:54 +00:00
Andrei Betlen
71f4582d44 Bump version 2023-05-31 23:25:39 -04:00
Andrei Betlen
a89e3ca35f Merge branch 'main' of github.com:abetlen/llama_cpp_python into main 2023-05-31 23:25:06 -04:00
Andrei Betlen
5377f9784a Update llama.cpp 2023-05-31 23:24:52 -04:00
Maximilian-Winter
1848afebe0 Added dependencies. 2023-05-31 22:41:35 +02:00
Maximilian-Winter
29f9c9cca3 Added both LlamaChache classes Disk and RAM. 2023-05-31 22:33:56 +02:00
Gary Mulder
217d78320f Added paramterised search and d/l for Hugging Face. Updated README.md 2023-05-31 16:00:31 +00:00
Gary Mulder
483b6ba53a Updated README.md instructions on how to use *_simple/Dockerfiles 2023-05-31 15:16:32 +00:00
Maximilian Winter
9ea7a379d3
Merge branch 'abetlen:main' into main 2023-05-31 12:55:51 +02:00
Andrei
232880cbc6
Merge pull request #297 from gjmulder/update-issue-tmpl-llamacpp-repro
Update bug_report.md
2023-05-30 14:48:14 -04:00
Gary Mulder
9dd8cf3472
Update bug_report.md
- Added section on how to repro using llama.cpp in ./vendor/llama.cpp
- Added a few more example environment commands to aid in debugging.
2023-05-30 08:20:34 +01:00
Andrei Betlen
cb0bcdbbb7 Bump version 2023-05-30 03:07:36 -04:00
Andrei Betlen
f4ff8a03c4 Add numpy dependency to pyproject 2023-05-30 03:06:57 -04:00
Andrei Betlen
828f9ec015 Merge branch 'main' of github.com:abetlen/llama_cpp_python into main 2023-05-29 21:39:40 -04:00
Andrei Betlen
b1daf568e3 Update changelog 2023-05-29 21:39:19 -04:00
Andrei
825912a3a5
Merge pull request #293 from abetlen/dependabot/pip/typing-extensions-4.6.2
Bump typing-extensions from 4.5.0 to 4.6.2
2023-05-29 21:28:27 -04:00
dependabot[bot]
f4fc126a00
Bump typing-extensions from 4.5.0 to 4.6.2
Bumps [typing-extensions](https://github.com/python/typing_extensions) from 4.5.0 to 4.6.2.
- [Changelog](https://github.com/python/typing_extensions/blob/main/CHANGELOG.md)
- [Commits](https://github.com/python/typing_extensions/compare/4.5.0...4.6.2)

---
updated-dependencies:
- dependency-name: typing-extensions
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2023-05-30 01:09:10 +00:00
Andrei
213549787a
Merge pull request #294 from abetlen/dependabot/pip/uvicorn-0.22.0
Bump uvicorn from 0.21.1 to 0.22.0
2023-05-29 21:07:41 -04:00
dependabot[bot]
8dfb0816df
Bump uvicorn from 0.21.1 to 0.22.0
Bumps [uvicorn](https://github.com/encode/uvicorn) from 0.21.1 to 0.22.0.
- [Release notes](https://github.com/encode/uvicorn/releases)
- [Changelog](https://github.com/encode/uvicorn/blob/master/CHANGELOG.md)
- [Commits](https://github.com/encode/uvicorn/compare/0.21.1...0.22.0)

---
updated-dependencies:
- dependency-name: uvicorn
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2023-05-30 01:04:25 +00:00
Andrei
f5d136d0aa
Merge pull request #295 from abetlen/dependabot/pip/mkdocs-material-9.1.15
Bump mkdocs-material from 9.1.14 to 9.1.15
2023-05-29 21:02:27 -04:00
Andrei
49fe9395a1
Merge pull request #277 from abetlen/add-numpy-support
Use numpy for internal buffers
2023-05-29 20:59:30 -04:00
dependabot[bot]
38b918503f
Bump mkdocs-material from 9.1.14 to 9.1.15
Bumps [mkdocs-material](https://github.com/squidfunk/mkdocs-material) from 9.1.14 to 9.1.15.
- [Release notes](https://github.com/squidfunk/mkdocs-material/releases)
- [Changelog](https://github.com/squidfunk/mkdocs-material/blob/master/CHANGELOG)
- [Commits](https://github.com/squidfunk/mkdocs-material/compare/9.1.14...9.1.15)

---
updated-dependencies:
- dependency-name: mkdocs-material
  dependency-type: direct:development
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2023-05-30 00:40:52 +00:00
Andrei
b61b016585
Merge pull request #296 from abetlen/dependabot/pip/scikit-build-0.17.5
Bump scikit-build from 0.13.0 to 0.17.5
2023-05-29 20:39:52 -04:00
dependabot[bot]
fa79484a29
Bump scikit-build from 0.13.0 to 0.17.5
Bumps [scikit-build](https://github.com/scikit-build/scikit-build) from 0.13.0 to 0.17.5.
- [Release notes](https://github.com/scikit-build/scikit-build/releases)
- [Changelog](https://github.com/scikit-build/scikit-build/blob/main/CHANGES.rst)
- [Commits](https://github.com/scikit-build/scikit-build/compare/0.13.0...0.17.5)

---
updated-dependencies:
- dependency-name: scikit-build
  dependency-type: direct:development
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2023-05-30 00:29:33 +00:00
Andrei
41e2184431
Merge pull request #292 from abetlen/dependabot/pip/mkdocstrings-0.22.0
Bump mkdocstrings from 0.21.2 to 0.22.0
2023-05-29 20:28:02 -04:00
dependabot[bot]
62ac7c3761
Bump mkdocstrings from 0.21.2 to 0.22.0
Bumps [mkdocstrings](https://github.com/mkdocstrings/mkdocstrings) from 0.21.2 to 0.22.0.
- [Release notes](https://github.com/mkdocstrings/mkdocstrings/releases)
- [Changelog](https://github.com/mkdocstrings/mkdocstrings/blob/master/CHANGELOG.md)
- [Commits](https://github.com/mkdocstrings/mkdocstrings/compare/0.21.2...0.22.0)

---
updated-dependencies:
- dependency-name: mkdocstrings
  dependency-type: direct:development
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2023-05-29 21:03:33 +00:00
Maximilian-Winter
719c3eae0a Diskcache implementation for llama state. 2023-05-28 15:56:38 +02:00
Andrei Betlen
80066f0b80 Use async routes 2023-05-27 09:12:58 -04:00
Andrei Betlen
c2b59a5f59 Import unnused import 2023-05-26 22:59:29 -04:00
Andrei Betlen
8f2b4456ad Format 2023-05-26 22:04:31 -04:00
Andrei Betlen
84e313bd6e Align dtype to match c structs 2023-05-26 22:02:16 -04:00
Andrei Betlen
b0b154cfa6 Add changelog message for numpy 2023-05-26 20:26:08 -04:00
Andrei Betlen
66bcb8d70d Merge branch 'main' into add-numpy-support 2023-05-26 20:25:03 -04:00
Andrei Betlen
8f35bddd7e Fix stop sequence performance bug. 2023-05-26 20:23:49 -04:00
Andrei Betlen
7fc7bc30e7 Remove usage of eval_tokens for cache check 2023-05-26 20:12:05 -04:00
Andrei Betlen
fe331ec589 Replace eval_logits and eval_tokens with numpy arrays 2023-05-26 20:03:31 -04:00
Andrei Betlen
00ea3af51b Add makefile 2023-05-26 17:56:20 -04:00
Andrei Betlen
447a3d249e Merge branch 'main' into setup 2023-05-26 17:53:58 -04:00
Andrei Betlen
efb763bcdc Merge branch 'main' into add-numpy-support 2023-05-26 17:37:33 -04:00
Andrei Betlen
030fafe901 Add project changelog 2023-05-26 17:32:34 -04:00
Andrei Betlen
878c7d363f Merge branch 'main' into add-numpy-support 2023-05-26 17:22:29 -04:00
Andrei Betlen
6075e17cb6 Bump version 2023-05-26 17:21:51 -04:00
Andrei Betlen
bd4b95da45 Reduce numpy version dependency 2023-05-26 16:38:21 -04:00
Andrei Betlen
8eb9769f78 Add support for numpy 2023-05-26 16:12:45 -04:00
Andrei
2adf6f3f9a
Merge pull request #265 from dmahurin/fix-from-bytes-byteorder
fix "from_bytes() missing required argument 'byteorder'"
2023-05-26 12:53:06 -04:00
Andrei
34ad71f448
Merge pull request #274 from dmahurin/fix-missing-antiprompt
low_level_api_chat_cpp.py: Fix missing antiprompt output in chat.
2023-05-26 12:52:34 -04:00
Andrei
d78453c045
Merge pull request #264 from dmahurin/fix-min-keep
fix "missing 1 required positional argument: 'min_keep'"
2023-05-26 12:52:05 -04:00
Andrei Betlen
4c1b7f7a76 Bugfix for logits_processor and stopping_criteria 2023-05-26 10:25:28 -04:00
Don Mahurin
0fa2ec4903 low_level_api_chat_cpp.py: Fix missing antiprompt output in chat. 2023-05-26 06:54:28 -07:00
Andrei Betlen
433a2e3e8a Add extra logits_processor and stopping_criteria 2023-05-26 03:13:24 -04:00