baalajimaestro/llama.cpp

Author	SHA1	Message	Date
Gary Mulder	cf4931a400	Working Open Llama 3B in a box	2023-06-02 08:48:54 +00:00
Gary Mulder	217d78320f	Added paramterised search and d/l for Hugging Face. Updated README.md	2023-05-31 16:00:31 +00:00
Gary Mulder	483b6ba53a	Updated README.md instructions on how to use *_simple/Dockerfiles	2023-05-31 15:16:32 +00:00
Gary Mulder	0e0c9bb978	Merge branch 'auto-docker' of github.com:gjmulder/llama-cpp-python-gary into auto-docker	2023-05-25 11:50:34 +00:00
Gary Mulder	0d2cc21202	Fixed repeated imports	2023-05-25 11:50:02 +00:00
Gary Mulder	ec44bdad61	Update README.md	2023-05-23 20:50:39 +01:00
Gary Mulder	ed19071ef8	Renamed and moved old Dockerfiles	2023-05-23 19:38:37 +00:00
Gary Mulder	70f629a72f	Update README.md	2023-05-23 20:36:21 +01:00
Gary Mulder	eaff7a8678	Initial commit of auto docker	2023-05-23 19:26:40 +00:00
Andrei Betlen	e5d596e0e9	Bump version	2023-05-22 23:50:58 -04:00
Andrei Betlen	c41b1ebca7	Update llama.cpp	2023-05-22 23:50:35 -04:00
Andrei	aa3d7a6299	Merge pull request #263 from abetlen/dependabot/pip/mkdocs-material-9.1.14 Bump mkdocs-material from 9.1.12 to 9.1.14	2023-05-22 23:44:51 -04:00
dependabot[bot]	2240b949ae	Bump mkdocs-material from 9.1.12 to 9.1.14 Bumps [mkdocs-material](https://github.com/squidfunk/mkdocs-material) from 9.1.12 to 9.1.14. - [Release notes](https://github.com/squidfunk/mkdocs-material/releases) - [Changelog](https://github.com/squidfunk/mkdocs-material/blob/master/CHANGELOG) - [Commits](https://github.com/squidfunk/mkdocs-material/compare/9.1.12...9.1.14) --- updated-dependencies: - dependency-name: mkdocs-material dependency-type: direct:development update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2023-05-22 21:18:57 +00:00
Andrei	01c79e7bf1	Merge pull request #258 from Pipboyguy/main Change docker build dynamic param to image instead of cuda version	2023-05-22 17:17:17 -04:00
Andrei	c3e80b1714	Merge pull request #262 from abetlen/dependabot/pip/httpx-0.24.1 Bump httpx from 0.24.0 to 0.24.1	2023-05-22 17:16:16 -04:00
dependabot[bot]	8e41d724ab	Bump httpx from 0.24.0 to 0.24.1 Bumps [httpx](https://github.com/encode/httpx) from 0.24.0 to 0.24.1. - [Release notes](https://github.com/encode/httpx/releases) - [Changelog](https://github.com/encode/httpx/blob/master/CHANGELOG.md) - [Commits](https://github.com/encode/httpx/compare/0.24.0...0.24.1) --- updated-dependencies: - dependency-name: httpx dependency-type: direct:development update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2023-05-22 21:05:39 +00:00
Marcel Coetzee	e6639e6620	Change docker build dynamic param to image instead of cuda version Signed-off-by: Marcel Coetzee <marcel@mooncoon.com>	2023-05-22 10:10:14 +02:00
Andrei	4f7a6daa25	Merge pull request #248 from localagi/main make git module accessible anonymously	2023-05-22 03:15:15 -04:00
Andrei Betlen	0adb9ec37a	Use model_name and index in response	2023-05-21 21:30:03 -04:00
Andrei Betlen	922b5b2bfd	Merge branch 'main' into server-embedding	2023-05-21 21:21:38 -04:00
Andrei Betlen	2c45255a0a	Bump version	2023-05-21 19:24:20 -04:00
Andrei Betlen	cd102e9da1	Cache shared library function calls for static tokens	2023-05-21 19:18:56 -04:00
Andrei Betlen	b895511cca	Fix penalize_nl	2023-05-21 18:38:06 -04:00
Andrei Betlen	03e2947b03	Fix unnecessary memory allocation while sampling	2023-05-21 18:36:34 -04:00
Andrei Betlen	fafe47114c	Update llama.cpp	2023-05-21 17:47:21 -04:00
Andrei Betlen	8f49ca0287	Bump version	2023-05-20 08:53:40 -04:00
Andrei Betlen	76b1d2cd20	Change properties to functions to match token functions	2023-05-20 08:24:06 -04:00
Andrei Betlen	a7ba85834f	Add n_ctx, n_vocab, and n_embd properties	2023-05-20 08:13:41 -04:00
Your Name	0b079a658c	make git module accessible anonymously	2023-05-20 02:25:59 +01:00
Simon Chabot	e783f1c191	feat: make embedding support list of string as input makes the /v1/embedding route similar to OpenAI api.	2023-05-20 01:23:32 +02:00
Andrei Betlen	01a010be52	Fix llama_cpp and Llama type signatures. Closes #221	2023-05-19 11:59:33 -04:00
Andrei Betlen	fb57b9470b	Merge branch 'main' of github.com:abetlen/llama_cpp_python into main	2023-05-19 03:19:32 -04:00
Andrei Betlen	f82d85fbee	Bump version	2023-05-19 03:19:27 -04:00
Andrei Betlen	c7788c85ab	Add Guidance example	2023-05-19 03:16:58 -04:00
Andrei Betlen	a8cd169251	Bugfix: Stop sequences can be strings	2023-05-19 03:15:08 -04:00
Andrei Betlen	f0812c4d8c	Add upgrade instructions to the README	2023-05-19 02:20:41 -04:00
Andrei Betlen	17d4271b04	Fix logprobs for completions and implement for streaming logprobs.	2023-05-19 02:20:27 -04:00
Andrei Betlen	a634a2453b	Allow first logprob token to be null to match openai api	2023-05-19 02:04:57 -04:00
Andrei Betlen	dc39cc0fa4	Use server sent events function for streaming completion	2023-05-19 02:04:30 -04:00
Andrei	69f9d50090	Merge pull request #235 from Pipboyguy/main Decrement CUDA version and bump Ubuntu	2023-05-18 13:42:04 -04:00
Andrei Betlen	f0ec6e615e	Stream tokens instead of text chunks	2023-05-18 11:35:59 -04:00
Andrei Betlen	21d8f5fa9f	Remove unnused union	2023-05-18 11:35:15 -04:00
Marcel Coetzee	6ece8a225a	Set CUDA_VERSION as build ARG Signed-off-by: Marcel Coetzee <marcel@mooncoon.com>	2023-05-18 16:59:42 +02:00
Marcel Coetzee	6c57d38552	Decrement CUDA version and bump Ubuntu Signed-off-by: Marcel Coetzee <marcel@mooncoon.com>	2023-05-18 16:02:42 +02:00
Andrei Betlen	50e136252a	Update llama.cpp	2023-05-17 16:14:12 -04:00
Andrei Betlen	db10e0078b	Update docs	2023-05-17 16:14:01 -04:00
Andrei Betlen	61d58e7b35	Check for CUDA_PATH before adding	2023-05-17 15:26:38 -04:00
Andrei Betlen	7c95895626	Merge branch 'main' of github.com:abetlen/llama_cpp_python into main	2023-05-17 15:19:32 -04:00
Andrei	47921a312c	Merge pull request #225 from aneeshjoy/main Fixed CUBLAS DLL load issues on Windows	2023-05-17 15:17:37 -04:00
Aneesh Joy	e9794f91f2	Fixd CUBLAS dll load issue in Windows	2023-05-17 18:04:58 +01:00

1 2 3 4 5 ...

529 commits