baalajimaestro/llama.cpp

Author	SHA1	Message	Date
Andrei Betlen	98bbd1c6a8	Fix eval logits type	2023-05-05 14:23:14 -04:00
Andrei Betlen	b5f3e74627	Add return type annotations for embeddings and logits	2023-05-05 14:22:55 -04:00
Andrei Betlen	3e28e0e50c	Fix: runtime type errors	2023-05-05 14:12:26 -04:00
Andrei Betlen	e24c3d7447	Prefer explicit imports	2023-05-05 14:05:31 -04:00
Andrei Betlen	40501435c1	Fix: types	2023-05-05 14:04:12 -04:00
Andrei Betlen	66e28eb548	Fix temperature bug	2023-05-05 14:00:41 -04:00
Andrei Betlen	6702d2abfd	Fix candidates type	2023-05-05 14:00:30 -04:00
Andrei Betlen	5e7ddfc3d6	Fix llama_cpp types	2023-05-05 13:54:22 -04:00
Andrei Betlen	b6a9a0b6ba	Add types for all low-level api functions	2023-05-05 12:22:27 -04:00
Andrei Betlen	5be0efa5f8	Cache should raise KeyError when key is missing	2023-05-05 12:21:49 -04:00
Andrei Betlen	24fc38754b	Add cli options to server. Closes #37	2023-05-05 12:08:28 -04:00
Andrei Betlen	5f583b0179	Merge branch 'main' of github.com:abetlen/llama_cpp_python into main	2023-05-04 21:59:40 -04:00
Andrei Betlen	5c165a85da	Bump version	2023-05-04 21:59:37 -04:00
Andrei Betlen	853dc711cc	Format	2023-05-04 21:58:36 -04:00
Andrei Betlen	97c6372350	Rewind model to longest prefix.	2023-05-04 21:58:27 -04:00
Andrei	38b8eeea58	Merge pull request #154 from th-neu/th-neu-dockerfile-slim Slim-Bullseye based docker image	2023-05-04 19:59:23 -04:00
Thomas Neu	5672ed7fea	Merge branch 'abetlen:main' into th-neu-dockerfile-slim	2023-05-04 21:41:13 +02:00
Thomas Neu	501321875f	Slim-Bullseye based docker image ends up at ~669MB	2023-05-04 21:03:19 +02:00
Andrei Betlen	cabd8b8ed1	Bump version	2023-05-04 12:21:20 -04:00
Andrei Betlen	d78cec67df	Update llama.cpp	2023-05-04 12:20:25 -04:00
Andrei Betlen	329297fafb	Bugfix: Missing logits_to_logprobs	2023-05-04 12:18:40 -04:00
Andrei Betlen	d594892fd4	Remove Docker CUDA build job	2023-05-04 00:02:46 -04:00
Andrei Betlen	0607f6578e	Use network installer for cuda	2023-05-03 23:22:16 -04:00
Andrei Betlen	6d3c20e39d	Add CUDA docker image build to github actions	2023-05-03 22:20:53 -04:00
Andrei Betlen	a02aa121da	Remove cuda build job	2023-05-03 10:50:48 -04:00
Andrei Betlen	07a56dd9c2	Update job name	2023-05-03 10:39:39 -04:00
Andrei Betlen	7839eb14d3	Add docker cuda image. Closes #143	2023-05-03 10:29:05 -04:00
Andrei Betlen	9e5b6d675a	Improve logging messages	2023-05-03 10:28:10 -04:00
Andrei Betlen	43f2907e3a	Support smaller state sizes	2023-05-03 09:33:50 -04:00
Andrei Betlen	1d47cce222	Update llama.cpp	2023-05-03 09:33:30 -04:00
Andrei Betlen	c2e31eecee	Update permissions	2023-05-02 01:23:17 -04:00
Andrei Betlen	63f8d3a6fb	Update context	2023-05-02 01:16:44 -04:00
Andrei Betlen	c21a34506e	Update permsissions	2023-05-02 01:13:43 -04:00
Andrei Betlen	872b2ec33f	Clone submodules	2023-05-02 01:11:34 -04:00
Andrei Betlen	62de4692f2	Fix missing dependency	2023-05-02 01:09:27 -04:00
Andrei	25062cecd3	Merge pull request #140 from abetlen/Niek/main Add Dockerfile	2023-05-02 01:06:00 -04:00
Andrei Betlen	36c81489e7	Remove docker section of publish	2023-05-02 01:04:36 -04:00
Andrei Betlen	5d5421b29d	Add build docker	2023-05-02 01:04:02 -04:00
Andrei Betlen	81631afc48	Install from local directory	2023-05-02 00:55:51 -04:00
Andrei Betlen	d605408f99	Add dockerignore	2023-05-02 00:55:34 -04:00
Andrei	e644e75915	Merge pull request #139 from matthoffner/patch-1 Fix FTYPE typo	2023-05-02 00:33:45 -04:00
Matt Hoffner	f97ff3c5bb	Update llama_cpp.py	2023-05-01 20:40:06 -07:00
Andrei Betlen	e9e0654aed	Bump version	2023-05-01 22:52:25 -04:00
Andrei Betlen	46e3c4b84a	Fix	2023-05-01 22:41:54 -04:00
Andrei Betlen	9eafc4c49a	Refactor server to use factory	2023-05-01 22:38:46 -04:00
Andrei Betlen	dd9ad1c759	Formatting	2023-05-01 21:51:16 -04:00
Andrei Betlen	9d60ae56f2	Fix whitespace	2023-05-01 18:07:45 -04:00
Andrei Betlen	53c0129eb6	Update submoduele clone instructions	2023-05-01 18:07:15 -04:00
Andrei Betlen	b6747f722e	Fix logprob calculation. Fixes #134	2023-05-01 17:45:08 -04:00
Andrei Betlen	c088a2b3a7	Un-skip tests	2023-05-01 15:46:03 -04:00

1 2 3 4 5 ...

362 commits