ollama

Author	SHA1	Message	Date
Daniel Hiltgen	f83e4db365	Switch runner for final release job The manifest and tagging step use a lot of disk space	2024-03-25 20:51:40 -07:00
Daniel Hiltgen	540f4af45f	Wire up more complete CI for releases Flesh out our github actions CI so we can build official releaes.	2024-03-15 12:37:36 -07:00
Blake Mizerany	8546dd3d72	.github: fix model and feature request yml (#3155 )	2024-03-14 15:26:06 -07:00
Blake Mizerany	87100be5e0	.github: add issue templates (#3143 )	2024-03-14 15:19:10 -07:00
Michael Yang	2cb74e23fb	fix ci	2024-03-07 11:33:49 -08:00
Daniel Hiltgen	3c8df3808b	Merge pull request #2885 from dhiltgen/rocm_v6_only Revamp ROCm support	2024-03-07 10:51:00 -08:00
Michael Yang	72431031d9	no ci test on docs, examples	2024-03-07 10:44:48 -08:00
Daniel Hiltgen	6c5ccb11f9	Revamp ROCm support This refines where we extract the LLM libraries to by adding a new OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already idempotenent, so this should speed up startups after the first time a new release is deployed. It also cleans up after itself. We now build only a single ROCm version (latest major) on both windows and linux. Given the large size of ROCms tensor files, we split the dependency out. It's bundled into the installer on windows, and a separate download on windows. The linux install script is now smart and detects the presence of AMD GPUs and looks to see if rocm v6 is already present, and if not, then downloads our dependency tar file. For Linux discovery, we now use sysfs and check each GPU against what ROCm supports so we can degrade to CPU gracefully instead of having llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows dynamic library loading logic to access the amdhip64.dll APIs to query the GPU information.	2024-03-07 10:36:50 -08:00
Jeffrey Morgan	d481fb3cc8	update go to 1.22 in other places (#2975 )	2024-03-07 07:39:49 -08:00
Michael Yang	46c847c4ad	enable rocm builds	2024-02-06 13:36:13 -08:00
Michael Yang	92b1a21f79	use linux runners	2024-02-06 13:36:04 -08:00
Michael Yang	f06b99a461	disable rocm builds	2024-02-06 09:29:42 -08:00
Michael Yang	a8c5413d06	only generate gpu libs	2024-01-25 15:41:56 -08:00
Michael Yang	5580de4571	archive ollama binaries	2024-01-25 15:40:16 -08:00
Michael Yang	946431d5b0	build cuda and rocm	2024-01-25 15:40:15 -08:00
Michael Yang	0610126049	remove env setting	2024-01-25 15:39:43 -08:00
Michael Yang	8e5d359a03	stub generate outputs for lint	2024-01-24 17:36:10 -08:00
Michael Yang	e299831e2c	Merge pull request #1958 from purificant/ci ci: update setup-go action	2024-01-18 14:53:36 -08:00
Daniel Hiltgen	ecbfc0182f	Go bump to v1.21 to pick up slog	2024-01-18 14:12:57 -08:00
Daniel Hiltgen	b992bf65fc	Disable arm64 for test phase The runners are x86 so we can only run binaries that match.	2024-01-17 19:26:13 -08:00
Daniel Hiltgen	1b249748ab	Add multiple CPU variants for Intel Mac This also refines the build process for the ext_server build.	2024-01-17 15:08:54 -08:00
Daniel Hiltgen	b3035112a1	Add macos cross-compile CI coverage	2024-01-14 10:38:59 -08:00
purificant	6a5bfc2ed6	update actions/setup-go	2024-01-12 22:27:25 +00:00
Michael Yang	997253143f	add lint and test on pull_request	2024-01-09 09:36:58 -08:00

24 commits