ollama

Author	SHA1	Message	Date
Jeffrey Morgan	06bcfbd629	cleanup docker section in readme	2023-10-15 02:33:25 -04:00
Jeffrey Morgan	7d7c2510f8	add `docker exec` command to readme	2023-10-15 02:31:15 -04:00
Jeffrey Morgan	f9b2f999ac	update readme with `docker` setup and link to `import.md`	2023-10-15 02:23:03 -04:00
Jeffrey Morgan	c416087339	`import.md`: formatting and spelling	2023-10-15 01:39:46 -04:00
Jeffrey Morgan	6002cebd2c	`import.md`: convert and quantize docs	2023-10-15 00:11:51 -04:00
Jeffrey Morgan	212bdc541c	`import.md`: model architectures spelling	2023-10-15 00:07:58 -04:00
Jeffrey Morgan	dca6686273	add steps for creating a Modelfile and more example commands to `import.md`	2023-10-15 00:05:50 -04:00
Jeffrey Morgan	598621afab	add push script for docker images	2023-10-14 14:24:39 -04:00
Matt Williams	6479f49c09	Merge pull request #773 from jmorganca/mattw/howtoquant add how to quantize doc	2023-10-14 08:29:39 -07:00
Matt Williams	b2974a7095	applied mikes comments Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-14 08:29:24 -07:00
Jeffrey Morgan	832b4db9d4	Use correct url for auto updates	2023-10-13 19:04:42 -04:00
Bruce MacDonald	c43873f33b	check update response (#785 )	2023-10-13 18:05:46 -04:00
Michael Yang	11d82d7b9b	update checkvram	2023-10-13 14:47:29 -07:00
Michael Yang	36fe2deebf	only check system memory on macos	2023-10-13 14:47:29 -07:00
Michael Yang	4a8931f634	check total (system + video) memory	2023-10-13 14:47:29 -07:00
Michael Yang	bd6e38fb1a	refactor memory check	2023-10-13 14:47:29 -07:00
Michael Yang	92189a5855	fix memory check	2023-10-13 14:47:29 -07:00
Michael Yang	d790bf9916	Merge pull request #783 from jmorganca/mxyng/fix-gpu-offloading fix: offloading on low end GPUs	2023-10-13 14:36:44 -07:00
Michael Yang	35afac099a	do not use gpu binary when num_gpu == 0	2023-10-13 14:32:12 -07:00
Michael Yang	811c3d1900	no gpu if vram < 2GB	2023-10-13 14:32:12 -07:00
Bruce MacDonald	3553d10769	check for newer updates (#784 ) Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2023-10-13 17:29:46 -04:00
Bruce MacDonald	6fe178134d	improve api error handling (#781 ) - remove new lines from llama.cpp error messages relayed to client - check api option types and return error on wrong type - change num layers from 95% VRAM to 92% VRAM	2023-10-13 16:57:10 -04:00
Jeffrey Morgan	d890890f66	use lower glibc versions in `Dockerfile.build`	2023-10-13 01:06:19 -04:00
Jeffrey Morgan	89ba19feca	use Go `1.21.3` in `Dockerfile`	2023-10-12 23:23:12 -04:00
Jeffrey Morgan	6f58c77671	update `Dockerfile.build` for linux binary builds	2023-10-12 22:14:20 -04:00
Matt Williams	3c975f898f	update doc to refer to docker image Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-12 15:57:50 -07:00
Matt Williams	9245c8a1df	add how to quantize doc Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-12 15:34:57 -07:00
Michael Yang	7a537cdca9	Merge pull request #770 from jmorganca/mxyng/fix-download fix download	2023-10-12 12:56:43 -07:00
Michael Yang	257ffeb997	fix download	2023-10-12 12:52:43 -07:00
Matt Williams	9b513bb6b1	Merge pull request #753 from jmorganca/mattw/examplereorg rename the examples to be more descriptive	2023-10-12 11:24:12 -07:00
Matt Williams	042100f797	final rename Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-12 11:23:41 -07:00
Bruce MacDonald	7804b8fab9	validate api options fields from map (#711 )	2023-10-12 11:18:11 -04:00
Bruce MacDonald	56497663c8	relay model runner error message to client (#720 ) * give direction to user when runner fails * also relay errors from timeout * increase timeout to 3 minutes	2023-10-12 11:16:37 -04:00
Matt Williams	e1afcb8af2	simple gen to simple Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-11 21:29:07 -07:00
Matt Williams	385eeea357	remove with Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-11 21:26:11 -07:00
Matt Williams	8a41b244e8	add golang gen Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-11 21:20:50 -07:00
Jeffrey Morgan	92578798bb	fix relative links in `README.md`	2023-10-11 19:24:06 -04:00
Michael Yang	788637918a	Merge pull request #760 from jmorganca/mxyng/more-downloads Mxyng/more downloads	2023-10-11 14:33:10 -07:00
Michael Yang	c413a55093	download: handle inner errors	2023-10-11 14:15:30 -07:00
Michael Yang	630bb75d2a	dynamically size download parts based on file size	2023-10-11 14:10:25 -07:00
Michael Yang	a2055a1e93	update download	2023-10-11 14:10:25 -07:00
Michael Yang	b599946b74	add format bytes	2023-10-11 14:08:23 -07:00
Michael Yang	aca2d65b82	Merge pull request #757 from jmorganca/mxyng/format-time cleanup format time	2023-10-11 11:12:29 -07:00
Michael Yang	b5e08e3373	cleanup format time	2023-10-11 11:09:27 -07:00
Bruce MacDonald	274d5a5fdf	optional parameter to not stream response (#639 ) * update streaming request accept header * add optional stream param to request bodies	2023-10-11 12:54:27 -04:00
Matt Williams	fc6b49be32	add ts alternate to python langchain simplegen Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-11 09:50:15 -07:00
Bruce MacDonald	77295f716e	prevent waiting on exited command (#752 ) * prevent waiting on exited command * close llama runner once	2023-10-11 12:32:13 -04:00
Matt Williams	615f7d1dea	cleanup readme. Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-11 06:13:29 -07:00
Matt Williams	cdf5e106ae	rename dirs Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-11 06:10:24 -07:00
Matt Williams	a85329f59a	rename the models to be more descriptive Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-10 17:40:02 -07:00

... 30 31 32 33 34 ...

2671 commits