ollama

Author	SHA1	Message	Date
Michael Yang	4d4f75a8a8	Revert "fix golangci workflow missing gofmt and goimports (#4190 )" This reverts commit `04f971c84b`.	2024-05-07 10:35:44 -07:00
alwqx	04f971c84b	fix golangci workflow missing gofmt and goimports (#4190 )	2024-05-07 09:49:40 -07:00
Michael Yang	7fea1ecdf6	Merge pull request #3958 from ollama/mxyng/fix-workflow use merge base for diff-tree	2024-04-26 14:17:56 -07:00
Blake Mizerany	054894271d	.github/workflows/test.yaml: add in-flight cancellations on new push (#3956 ) Also, remove a superfluous 'go get'	2024-04-26 13:54:24 -07:00
Michael Yang	6fef042f0b	use merge base for diff-tree	2024-04-26 13:54:15 -07:00
Daniel Hiltgen	8feb97dc0d	Move cuda/rocm dependency gathering into generate script This will make it simpler for CI to accumulate artifacts from prior steps	2024-04-25 22:38:44 -07:00
Daniel Hiltgen	8589d752ac	Fix release CI download-artifact path was being used incorrectly. It is where to extract the zip not the files in the zip to extract. Default is workspace dir which is what we want, so omit it	2024-04-25 17:27:11 -07:00
Daniel Hiltgen	058f6cd2cc	Move nested payloads to installer and zip file on windows Now that the llm runner is an executable and not just a dll, more users are facing problems with security policy configurations on windows that prevent users writing to directories and then executing binaries from the same location. This change removes payloads from the main executable on windows and shifts them over to be packaged in the installer and discovered based on the executables location. This also adds a new zip file for people who want to "roll their own" installation model.	2024-04-23 16:14:47 -07:00
Daniel Hiltgen	939d6a8606	Make CI lint verbvose	2024-04-23 10:17:42 -07:00
jmorganca	c8afe7168c	use correct extension for feature and model request issue templates	2024-04-17 15:18:40 -04:00
jmorganca	28d3cd0148	simpler feature and model request forms	2024-04-17 15:17:08 -04:00
jmorganca	eb5554232a	simpler feature and model request forms	2024-04-17 15:14:49 -04:00
jmorganca	2bdc320216	add descriptions to issue templates	2024-04-17 15:08:36 -04:00
jmorganca	32561aed09	simplify github issue templates a bit	2024-04-17 15:07:03 -04:00
Michael Yang	2b4ca6cf36	fix ci	2024-04-10 11:35:12 -07:00
Blake Mizerany	1524f323a3	Revert "build.go: introduce a friendlier way to build Ollama (#3548 )" (#3564 )	2024-04-09 15:57:45 -07:00
Blake Mizerany	fccf3eecaa	build.go: introduce a friendlier way to build Ollama (#3548 ) This commit introduces a more friendly way to build Ollama dependencies and the binary without abusing `go generate` and removing the unnecessary extra steps it brings with it. This script also provides nicer feedback to the user about what is happening during the build process. At the end, it prints a helpful message to the user about what to do next (e.g. run the new local Ollama).	2024-04-09 14:18:47 -07:00
Michael Yang	cb8352d6b4	ci: use go-version-file	2024-04-09 09:50:12 -07:00
Jeffrey Morgan	b0e7d35db8	use an older version of the mac os sdk in release (#3484 )	2024-04-04 09:48:54 -07:00
Daniel Hiltgen	883ec4d1ef	CI missing archive	2024-04-04 07:23:27 -07:00
Daniel Hiltgen	08600d5bec	CI subprocess path fix	2024-04-03 19:12:53 -07:00
Daniel Hiltgen	e4a7e5b2ca	Fix CI release glitches The subprocess change moved the build directory arm64 builds weren't setting cross-compilation flags when building on x86	2024-04-03 16:41:40 -07:00
Jeffrey Morgan	cd135317d2	Fix macOS builds on older SDKs (#3467 )	2024-04-03 10:45:54 -07:00
Daniel Hiltgen	841adda157	Fix windows lint CI flakiness	2024-04-02 12:22:16 -07:00
Daniel Hiltgen	58d95cc9bd	Switch back to subprocessing for llama.cpp This should resolve a number of memory leak and stability defects by allowing us to isolate llama.cpp in a separate process and shutdown when idle, and gracefully restart if it has problems. This also serves as a first step to be able to run multiple copies to support multiple models concurrently.	2024-04-01 16:48:18 -07:00
Michael Yang	1ec0df1069	fix generate output	2024-04-01 13:47:34 -07:00
Jeffrey Morgan	06a1508bfe	Update 90_bug_report.yml	2024-03-29 10:11:17 -04:00
Daniel Hiltgen	97ae517fbf	Merge pull request #3398 from dhiltgen/release_latest CI automation for tagging latest images	2024-03-28 16:25:54 -07:00
Daniel Hiltgen	44b813e459	Merge pull request #3377 from dhiltgen/rocm_v6_bump Bump ROCm to 6.0.2 patch release	2024-03-28 16:07:54 -07:00
Daniel Hiltgen	539043f5e0	CI automation for tagging latest images	2024-03-28 16:07:37 -07:00
Daniel Hiltgen	c91a4ebcff	Bump ROCm to 6.0.2 patch release	2024-03-28 15:58:57 -07:00
Daniel Hiltgen	b79c7e4528	CI windows gpu builds If we're doing generate, test windows cuda and rocm as well	2024-03-28 14:39:10 -07:00
Michael Yang	5255d0af8a	fix: workflows	2024-03-27 16:30:01 -07:00
Michael Yang	8838ae787d	stub stub	2024-03-27 13:59:12 -07:00
Michael Yang	db75402ade	mangle arch	2024-03-27 13:44:50 -07:00
Michael Yang	1e85a140a3	only generate on changes to llm subdirectory	2024-03-27 12:45:35 -07:00
Michael Yang	5b0c48d29e	only generate cuda/rocm when changes to llm detected	2024-03-27 12:23:09 -07:00
Daniel Hiltgen	f83e4db365	Switch runner for final release job The manifest and tagging step use a lot of disk space	2024-03-25 20:51:40 -07:00
Daniel Hiltgen	540f4af45f	Wire up more complete CI for releases Flesh out our github actions CI so we can build official releaes.	2024-03-15 12:37:36 -07:00
Blake Mizerany	8546dd3d72	.github: fix model and feature request yml (#3155 )	2024-03-14 15:26:06 -07:00
Blake Mizerany	87100be5e0	.github: add issue templates (#3143 )	2024-03-14 15:19:10 -07:00
Michael Yang	2cb74e23fb	fix ci	2024-03-07 11:33:49 -08:00
Daniel Hiltgen	3c8df3808b	Merge pull request #2885 from dhiltgen/rocm_v6_only Revamp ROCm support	2024-03-07 10:51:00 -08:00
Michael Yang	72431031d9	no ci test on docs, examples	2024-03-07 10:44:48 -08:00
Daniel Hiltgen	6c5ccb11f9	Revamp ROCm support This refines where we extract the LLM libraries to by adding a new OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already idempotenent, so this should speed up startups after the first time a new release is deployed. It also cleans up after itself. We now build only a single ROCm version (latest major) on both windows and linux. Given the large size of ROCms tensor files, we split the dependency out. It's bundled into the installer on windows, and a separate download on windows. The linux install script is now smart and detects the presence of AMD GPUs and looks to see if rocm v6 is already present, and if not, then downloads our dependency tar file. For Linux discovery, we now use sysfs and check each GPU against what ROCm supports so we can degrade to CPU gracefully instead of having llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows dynamic library loading logic to access the amdhip64.dll APIs to query the GPU information.	2024-03-07 10:36:50 -08:00
Jeffrey Morgan	d481fb3cc8	update go to 1.22 in other places (#2975 )	2024-03-07 07:39:49 -08:00
Michael Yang	46c847c4ad	enable rocm builds	2024-02-06 13:36:13 -08:00
Michael Yang	92b1a21f79	use linux runners	2024-02-06 13:36:04 -08:00
Michael Yang	f06b99a461	disable rocm builds	2024-02-06 09:29:42 -08:00
Michael Yang	a8c5413d06	only generate gpu libs	2024-01-25 15:41:56 -08:00

1 2

61 commits