ollama

Author	SHA1	Message	Date
Bruce MacDonald	565648f3f7	relay CUDA errors to the client (#825 )	2023-10-18 15:36:56 -04:00
Michael Yang	3a2477174f	Merge pull request #822 from ggozad/fix-tags-api Fix /api/tags for no models.	2023-10-18 09:34:00 -07:00
Yiorgis Gozadinos	8c6c2cbc8c	When the .ollama folder is broken or there are no models return an empty list on /api/tags	2023-10-18 08:23:20 +02:00
Michael Yang	08b0e04f40	Merge pull request #813 from jmorganca/mxyng/llama refactor llm/llama.go	2023-10-17 14:05:58 -07:00
Michael Yang	b36b0b71f8	use cut prefix	2023-10-17 14:01:39 -07:00
Michael Yang	094df37563	remove unused struct	2023-10-17 14:01:38 -07:00
Bruce MacDonald	f3648fd206	Update llama.cpp gguf to latest (#710 )	2023-10-17 16:55:16 -04:00
Bruce MacDonald	bd93a94abd	fix MB VRAM log output (#824 )	2023-10-17 15:35:16 -04:00
Michael Yang	f55bdb6f10	Merge pull request #799 from deichbewohner/jsonmarshaling Fix JSON Marshal Escaping for Special Characters	2023-10-17 08:46:02 -07:00
Michael Yang	2870a9bfc8	Merge pull request #812 from jmorganca/mxyng/fix-format-string fix: wrong format string type	2023-10-17 08:40:49 -07:00
Michael Yang	c031c211d1	Merge pull request #809 from jmorganca/mxyng/fix-gpu fix: regression unsupported metal types	2023-10-17 08:40:40 -07:00
Andreas Wäscher	68391b0055	Add OllamaSharp for .NET (#811 )	2023-10-17 11:31:48 -04:00
Alexander F. Rødseth	b7e137323a	Fix a typo (#818 )	2023-10-17 09:00:15 -04:00
Arne Müller	8fa3f366ad	Removed newline trimming and used buffer directly in POST request.	2023-10-17 08:17:35 +02:00
Michael Yang	fddb303f23	fix: format string wrong type	2023-10-16 16:14:28 -07:00
Michael Yang	ad5ee20c7b	Merge pull request #794 from ggozad/add_oterm Add oterm to community integrations	2023-10-16 15:51:55 -07:00
Michael Yang	785b4eb5bf	Merge branch 'main' into add_oterm	2023-10-16 15:51:44 -07:00
Michael Yang	16ede1b30b	Merge pull request #801 from s-kostyaev/add-ellama-community-integration Add ellama community integration	2023-10-16 15:51:25 -07:00
Michael Yang	17d6bbbb2a	Merge pull request #810 from vieux/patch-1 Update install.sh	2023-10-16 15:50:57 -07:00
Victor Vieux	6481b7f34c	Update install.sh, avoid ARCH: unbound variable	2023-10-16 14:40:24 -07:00
Michael Yang	cb4a80b693	fix: regression unsupported metal types omitting `--n-gpu-layers` means use metal on macos which isn't correct since ollama uses `num_gpu=0` to explicitly disable gpu for file types that are not implemented in metal	2023-10-16 14:37:20 -07:00
Bruce MacDonald	68d7255bd3	show request to server rather than local check (#778 )	2023-10-16 17:27:25 -04:00
Michael Yang	9ef2fce33a	Merge pull request #768 from jmorganca/mxyng/bytes fix memory check	2023-10-16 12:42:41 -07:00
Michael Yang	43eaba3d60	Merge pull request #787 from jmorganca/mxyng/server-version2 server: print version on start	2023-10-16 09:59:30 -07:00
Michael Yang	1af493c5a0	server: print version on start	2023-10-16 09:59:14 -07:00
Bruce MacDonald	a0c3e989de	deprecate modelfile embed command (#759 )	2023-10-16 11:07:37 -04:00
Sergey Kostyaev	7af0fdce48	add ellama community integration	2023-10-16 16:39:10 +07:00
Arne Müller	ee94693b1a	handling unescaped json marshaling	2023-10-16 11:15:55 +02:00
Yiorgis Gozadinos	731dbdc1a5	Add oterm to community integrations	2023-10-15 23:21:17 +02:00
Jeffrey Morgan	06bcfbd629	cleanup docker section in readme	2023-10-15 02:33:25 -04:00
Jeffrey Morgan	7d7c2510f8	add `docker exec` command to readme	2023-10-15 02:31:15 -04:00
Jeffrey Morgan	f9b2f999ac	update readme with `docker` setup and link to `import.md`	2023-10-15 02:23:03 -04:00
Jeffrey Morgan	c416087339	`import.md`: formatting and spelling	2023-10-15 01:39:46 -04:00
Jeffrey Morgan	6002cebd2c	`import.md`: convert and quantize docs	2023-10-15 00:11:51 -04:00
Jeffrey Morgan	212bdc541c	`import.md`: model architectures spelling	2023-10-15 00:07:58 -04:00
Jeffrey Morgan	dca6686273	add steps for creating a Modelfile and more example commands to `import.md`	2023-10-15 00:05:50 -04:00
Jeffrey Morgan	598621afab	add push script for docker images	2023-10-14 14:24:39 -04:00
Matt Williams	6479f49c09	Merge pull request #773 from jmorganca/mattw/howtoquant add how to quantize doc	2023-10-14 08:29:39 -07:00
Matt Williams	b2974a7095	applied mikes comments Signed-off-by: Matt Williams <m@technovangelist.com>	2023-10-14 08:29:24 -07:00
Jeffrey Morgan	832b4db9d4	Use correct url for auto updates	2023-10-13 19:04:42 -04:00
Bruce MacDonald	c43873f33b	check update response (#785 )	2023-10-13 18:05:46 -04:00
Michael Yang	11d82d7b9b	update checkvram	2023-10-13 14:47:29 -07:00
Michael Yang	36fe2deebf	only check system memory on macos	2023-10-13 14:47:29 -07:00
Michael Yang	4a8931f634	check total (system + video) memory	2023-10-13 14:47:29 -07:00
Michael Yang	bd6e38fb1a	refactor memory check	2023-10-13 14:47:29 -07:00
Michael Yang	92189a5855	fix memory check	2023-10-13 14:47:29 -07:00
Michael Yang	d790bf9916	Merge pull request #783 from jmorganca/mxyng/fix-gpu-offloading fix: offloading on low end GPUs	2023-10-13 14:36:44 -07:00
Michael Yang	35afac099a	do not use gpu binary when num_gpu == 0	2023-10-13 14:32:12 -07:00
Michael Yang	811c3d1900	no gpu if vram < 2GB	2023-10-13 14:32:12 -07:00
Bruce MacDonald	3553d10769	check for newer updates (#784 ) Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2023-10-13 17:29:46 -04:00

1 2 3 4 5 ...

1150 commits