ollama

Author	SHA1	Message	Date
Jeffrey Morgan	92578798bb	fix relative links in `README.md`	2023-10-11 19:24:06 -04:00
Michael Yang	788637918a	Merge pull request #760 from jmorganca/mxyng/more-downloads Mxyng/more downloads	2023-10-11 14:33:10 -07:00
Michael Yang	c413a55093	download: handle inner errors	2023-10-11 14:15:30 -07:00
Michael Yang	630bb75d2a	dynamically size download parts based on file size	2023-10-11 14:10:25 -07:00
Michael Yang	a2055a1e93	update download	2023-10-11 14:10:25 -07:00
Michael Yang	b599946b74	add format bytes	2023-10-11 14:08:23 -07:00
Michael Yang	aca2d65b82	Merge pull request #757 from jmorganca/mxyng/format-time cleanup format time	2023-10-11 11:12:29 -07:00
Michael Yang	b5e08e3373	cleanup format time	2023-10-11 11:09:27 -07:00
Bruce MacDonald	274d5a5fdf	optional parameter to not stream response (#639 ) * update streaming request accept header * add optional stream param to request bodies	2023-10-11 12:54:27 -04:00
Matt Williams	fc6b49be32	add ts alternate to python langchain simplegen Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-11 09:50:15 -07:00
Bruce MacDonald	77295f716e	prevent waiting on exited command (#752 ) * prevent waiting on exited command * close llama runner once	2023-10-11 12:32:13 -04:00
Matt Williams	615f7d1dea	cleanup readme. Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-11 06:13:29 -07:00
Matt Williams	cdf5e106ae	rename dirs Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-11 06:10:24 -07:00
Matt Williams	a85329f59a	rename the models to be more descriptive Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-10 17:40:02 -07:00
Bruce MacDonald	f2ba1311aa	improve vram safety with 5% vram memory buffer (#724 ) * check free memory not total * wait for subprocess to exit	2023-10-10 16:16:09 -04:00
Jeffrey Morgan	65dcd0ce35	always cleanup blob download (#747 )	2023-10-10 13:12:29 -04:00
Michael Yang	0040f543a2	Merge pull request #743 from jmorganca/mxyng/http-proxy handle upstream proxies	2023-10-10 09:59:06 -07:00
Matt Williams	767f9bdbbb	Merge pull request #585 from jmorganca/matt/examplementors add the example for ask the mentors	2023-10-09 13:58:14 -07:00
Costa Alexoglou	f7f5169c94	Update api.md (#741 ) Avoid triple ticks in visual editor and also copied in clipboard.	2023-10-09 16:01:46 -04:00
Michael Yang	2cfffea02e	handle client proxy	2023-10-09 12:33:47 -07:00
Michael Yang	f6e98334e4	handle upstream proxies	2023-10-09 11:42:36 -07:00
Jeffrey Morgan	ab0668293c	llm: fix build on `amd64`	2023-10-06 14:39:54 -07:00
Bruce MacDonald	af4cf55884	not found error before pulling model (#718 )	2023-10-06 16:06:20 -04:00
Bruce MacDonald	d6786f2945	add feedback for reading model metadata (#722 )	2023-10-06 16:05:32 -04:00
Michael Yang	38dc2f79bc	Merge pull request #626 from jmorganca/mxyng/concurrent-downloads parallel chunked downloads	2023-10-06 13:01:29 -07:00
Michael Yang	cb961c87ca	Merge pull request #679 from jamesbraza/modelfile-docs `Modelfile` syntax highlighting	2023-10-06 12:59:45 -07:00
Michael Yang	0560b28a8d	names	2023-10-06 12:56:56 -07:00
Michael Yang	10199c5987	replace done channel with file check	2023-10-06 12:56:56 -07:00
Michael Yang	288814d3e4	fix ref counts	2023-10-06 12:56:43 -07:00
Michael Yang	04733438da	check head request response	2023-10-06 12:56:43 -07:00
Michael Yang	711e891f0f	fix resumable downloads glob returns files in lexical order which is not appropriate when rebuilding the parts list	2023-10-06 12:56:43 -07:00
Michael Yang	090d08422b	handle unexpected eofs	2023-10-06 12:56:43 -07:00
Michael Yang	5b84404c64	handle concurrent requests for the same blobs	2023-10-06 12:56:43 -07:00
Michael Yang	8544edca21	parallel chunked downloads	2023-10-06 12:56:43 -07:00
Bruce MacDonald	5d22319a2c	rename server subprocess (#700 ) - this makes it easier to see that the subprocess is associated with ollama	2023-10-06 10:15:42 -04:00
Bruce MacDonald	2130c0708b	output type parsed from modelfile (#678 )	2023-10-05 14:58:04 -04:00
Patrick Devine	61ff1946e6	revise help text (#706 )	2023-10-05 11:36:07 -07:00
Bruce MacDonald	d06bc0cb6e	enable q8, q5, 5_1, and f32 for linux gpu (#699 )	2023-10-05 12:53:47 -04:00
Alexander F. Rødseth	d104b7e997	Fix go test./... issue: fmt.Println arg list ends with redundant newline (#705 )	2023-10-05 11:11:04 -04:00
Bruce MacDonald	9e2de1bd2c	increase streaming buffer size (#692 )	2023-10-04 14:09:00 -04:00
Jeffrey Morgan	dc87e9c9ae	update `Dockerfile` to pass `GOFLAGS`	2023-10-03 07:05:15 -07:00
Michael Yang	367cb68dc1	Merge pull request #686 from jmorganca/mxyng/starcoder decode starcoder	2023-10-02 22:47:19 -07:00
Michael Yang	c02c0cd483	starcoder	2023-10-02 19:56:51 -07:00
Patrick Devine	1852755154	show a default message when license/parameters/system prompt/template aren't specified (#681 )	2023-10-02 14:34:52 -07:00
James Braza	6f2ce74231	Got rif of all caps to show it can be lower case	2023-10-02 13:54:27 -07:00
James Braza	6edcc5c79f	Using code highlighting syntax around Modelfile	2023-10-02 13:46:05 -07:00
Bruce MacDonald	b1f7123301	clean up num_gpu calculation code (#673 )	2023-10-02 14:53:42 -04:00
Bruce MacDonald	1fbf3585d6	Relay default values to llama runner (#672 ) * include seed in params for llama.cpp server and remove empty filter for temp * relay default predict options to llama.cpp - reorganize options to match predict request for readability * omit empty stop --------- Co-authored-by: hallh <hallh@users.noreply.github.com>	2023-10-02 14:53:16 -04:00
Patrick Devine	99d5161e8a	don't wordwrap when stdout is redirected or piped (#662 )	2023-10-02 11:50:55 -07:00
Michael	ea8380be45	add community project: Chatbot Ollama add community project: Chatbot Ollama by @ivanfioravanti	2023-10-02 09:04:31 -07:00

... 38 39 40 41 42 ...

3035 commits