baalajimaestro/llama.cpp

Author	SHA1	Message	Date
Andrei	01c79e7bf1	Merge pull request #258 from Pipboyguy/main Change docker build dynamic param to image instead of cuda version	2023-05-22 17:17:17 -04:00
Andrei	c3e80b1714	Merge pull request #262 from abetlen/dependabot/pip/httpx-0.24.1 Bump httpx from 0.24.0 to 0.24.1	2023-05-22 17:16:16 -04:00
dependabot[bot]	8e41d724ab	Bump httpx from 0.24.0 to 0.24.1 Bumps [httpx](https://github.com/encode/httpx) from 0.24.0 to 0.24.1. - [Release notes](https://github.com/encode/httpx/releases) - [Changelog](https://github.com/encode/httpx/blob/master/CHANGELOG.md) - [Commits](https://github.com/encode/httpx/compare/0.24.0...0.24.1) --- updated-dependencies: - dependency-name: httpx dependency-type: direct:development update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2023-05-22 21:05:39 +00:00
Marcel Coetzee	e6639e6620	Change docker build dynamic param to image instead of cuda version Signed-off-by: Marcel Coetzee <marcel@mooncoon.com>	2023-05-22 10:10:14 +02:00
Andrei	4f7a6daa25	Merge pull request #248 from localagi/main make git module accessible anonymously	2023-05-22 03:15:15 -04:00
Andrei Betlen	0adb9ec37a	Use model_name and index in response	2023-05-21 21:30:03 -04:00
Andrei Betlen	922b5b2bfd	Merge branch 'main' into server-embedding	2023-05-21 21:21:38 -04:00
Andrei Betlen	2c45255a0a	Bump version	2023-05-21 19:24:20 -04:00
Andrei Betlen	cd102e9da1	Cache shared library function calls for static tokens	2023-05-21 19:18:56 -04:00
Andrei Betlen	b895511cca	Fix penalize_nl	2023-05-21 18:38:06 -04:00
Andrei Betlen	03e2947b03	Fix unnecessary memory allocation while sampling	2023-05-21 18:36:34 -04:00
Andrei Betlen	fafe47114c	Update llama.cpp	2023-05-21 17:47:21 -04:00
Andrei Betlen	8f49ca0287	Bump version	2023-05-20 08:53:40 -04:00
Andrei Betlen	76b1d2cd20	Change properties to functions to match token functions	2023-05-20 08:24:06 -04:00
Andrei Betlen	a7ba85834f	Add n_ctx, n_vocab, and n_embd properties	2023-05-20 08:13:41 -04:00
Your Name	0b079a658c	make git module accessible anonymously	2023-05-20 02:25:59 +01:00
Simon Chabot	e783f1c191	feat: make embedding support list of string as input makes the /v1/embedding route similar to OpenAI api.	2023-05-20 01:23:32 +02:00
Andrei Betlen	01a010be52	Fix llama_cpp and Llama type signatures. Closes #221	2023-05-19 11:59:33 -04:00
Andrei Betlen	fb57b9470b	Merge branch 'main' of github.com:abetlen/llama_cpp_python into main	2023-05-19 03:19:32 -04:00
Andrei Betlen	f82d85fbee	Bump version	2023-05-19 03:19:27 -04:00
Andrei Betlen	c7788c85ab	Add Guidance example	2023-05-19 03:16:58 -04:00
Andrei Betlen	a8cd169251	Bugfix: Stop sequences can be strings	2023-05-19 03:15:08 -04:00
Andrei Betlen	f0812c4d8c	Add upgrade instructions to the README	2023-05-19 02:20:41 -04:00
Andrei Betlen	17d4271b04	Fix logprobs for completions and implement for streaming logprobs.	2023-05-19 02:20:27 -04:00
Andrei Betlen	a634a2453b	Allow first logprob token to be null to match openai api	2023-05-19 02:04:57 -04:00
Andrei Betlen	dc39cc0fa4	Use server sent events function for streaming completion	2023-05-19 02:04:30 -04:00
Andrei	69f9d50090	Merge pull request #235 from Pipboyguy/main Decrement CUDA version and bump Ubuntu	2023-05-18 13:42:04 -04:00
Andrei Betlen	f0ec6e615e	Stream tokens instead of text chunks	2023-05-18 11:35:59 -04:00
Andrei Betlen	21d8f5fa9f	Remove unnused union	2023-05-18 11:35:15 -04:00
Marcel Coetzee	6ece8a225a	Set CUDA_VERSION as build ARG Signed-off-by: Marcel Coetzee <marcel@mooncoon.com>	2023-05-18 16:59:42 +02:00
Marcel Coetzee	6c57d38552	Decrement CUDA version and bump Ubuntu Signed-off-by: Marcel Coetzee <marcel@mooncoon.com>	2023-05-18 16:02:42 +02:00
Andrei Betlen	50e136252a	Update llama.cpp	2023-05-17 16:14:12 -04:00
Andrei Betlen	db10e0078b	Update docs	2023-05-17 16:14:01 -04:00
Andrei Betlen	61d58e7b35	Check for CUDA_PATH before adding	2023-05-17 15:26:38 -04:00
Andrei Betlen	7c95895626	Merge branch 'main' of github.com:abetlen/llama_cpp_python into main	2023-05-17 15:19:32 -04:00
Andrei	47921a312c	Merge pull request #225 from aneeshjoy/main Fixed CUBLAS DLL load issues on Windows	2023-05-17 15:17:37 -04:00
Aneesh Joy	e9794f91f2	Fixd CUBLAS dll load issue in Windows	2023-05-17 18:04:58 +01:00
Andrei Betlen	70695c430b	Move docs link up	2023-05-17 11:40:12 -04:00
Andrei Betlen	4f342795e5	Update token checks	2023-05-17 03:35:13 -04:00
Andrei Betlen	626003c884	Merge branch 'main' of github.com:abetlen/llama_cpp_python into main	2023-05-17 02:00:48 -04:00
Andrei Betlen	f5c2f998ab	Format	2023-05-17 02:00:39 -04:00
Andrei Betlen	d28b753ed2	Implement penalize_nl	2023-05-17 01:53:26 -04:00
Andrei Betlen	f11e2a781c	Fix last_n_tokens_size	2023-05-17 01:42:51 -04:00
Andrei Betlen	7e55244540	Fix top_k value. Closes #220	2023-05-17 01:41:42 -04:00
Andrei Betlen	e37a808bc0	Update llama.cpp	2023-05-16 23:33:53 -04:00
Andrei Betlen	a7c9e38287	Update variable name	2023-05-16 18:07:25 -04:00
Andrei Betlen	a3352923c7	Add model_alias option to override model_path in completions. Closes #39	2023-05-16 17:22:00 -04:00
Andrei Betlen	214589e462	Update llama.cpp	2023-05-16 17:20:45 -04:00
Andrei Betlen	a65125c0bd	Add sampling defaults for generate	2023-05-16 09:35:50 -04:00
Andrei Betlen	341c50b5b0	Fix CMakeLists.txt	2023-05-16 09:07:14 -04:00

1 2 3 4 5 ...

516 commits