ollama

Author	SHA1	Message	Date
Daniel Hiltgen	a54d4a28dc	Merge pull request #3088 from dhiltgen/rocm_igpu_linux Fix iGPU detection for linux	2024-03-12 17:20:27 -07:00
Daniel Hiltgen	82b0c7c27e	Fix iGPU detection for linux This fixes a few bugs in the new sysfs discovery logic. iGPUs are now correctly identified by their <1G VRAM reported. the sysfs IDs are off by one compared to what HIP wants due to the CPU being reported in amdgpu, but HIP only cares about GPUs.	2024-03-12 16:57:19 -07:00
Patrick Devine	ba7cf7fb66	add more docs on for the modelfile message command (#3087 )	2024-03-12 16:41:41 -07:00
Bruce MacDonald	2f804068bd	warn when json format is expected but not mentioned in prompt (#3081 )	2024-03-12 19:07:11 -04:00
Daniel Hiltgen	85129d3a32	Adapt our build for imported server.cpp	2024-03-12 14:57:15 -07:00
Daniel Hiltgen	9ac6440da3	Import server.cpp as of b2356	2024-03-12 13:58:06 -07:00
Michael Yang	0085297928	refactor readseeker	2024-03-12 12:54:18 -07:00
Daniel Hiltgen	34d00f90b1	Merge pull request #3070 from dhiltgen/visible_devices Add docs explaining GPU selection env vars	2024-03-12 11:36:46 -07:00
Daniel Hiltgen	b53229a2ed	Add docs explaining GPU selection env vars	2024-03-12 11:33:06 -07:00
racerole	53c107e20e	chore: fix typo (#3073 ) Signed-off-by: racerole <jiangyifeng@outlook.com>	2024-03-12 14:09:22 -04:00
mofanke	51578d8573	fix gpu_info_cuda.c compile warning (#3077 )	2024-03-12 14:08:40 -04:00
Jeffrey Morgan	b5fcd9d3aa	use `-trimpath` when building releases (#3069 )	2024-03-11 15:58:46 -07:00
Bruce MacDonald	b80661e8c7	relay load model errors to the client (#3065 )	2024-03-11 16:48:27 -04:00
Jeffrey Morgan	6d3adfbea2	Update troubleshooting.md	2024-03-11 13:22:28 -07:00
Jeffrey Morgan	369eda65f5	update llama.cpp submodule to `ceca1ae` (#3064 )	2024-03-11 12:57:48 -07:00
Michael Yang	f878e91070	Merge pull request #3044 from ollama/mxyng/fix-convert-shape convert: fix shape	2024-03-11 09:56:57 -07:00
Daniel Hiltgen	0d651478e4	Merge pull request #3056 from dhiltgen/rocm_link_clash Avoid rocm runner and dependency clash	2024-03-11 09:48:48 -07:00
Michael Yang	9ea492f1ce	convert: fix shape	2024-03-11 09:41:01 -07:00
Daniel Hiltgen	bc13da2bfe	Avoid rocm runner and dependency clash Putting the rocm symlink next to the runners is risky. This moves the payloads into a subdir to avoid potential clashes.	2024-03-11 09:33:22 -07:00
Jeffrey Morgan	41b00b9856	fix `03-locale.diff`	2024-03-10 16:21:05 -07:00
Daniel Hiltgen	c2a8ed48e7	Merge pull request #3048 from dhiltgen/harden_rocm_deps Harden for deps file being empty (or short)	2024-03-10 15:17:22 -07:00
Daniel Hiltgen	3dc1bb6a35	Harden for deps file being empty (or short)	2024-03-10 14:45:38 -07:00
Daniel Hiltgen	7865a6996a	Merge pull request #3046 from dhiltgen/rocm_search_paths Add ollama executable peer dir for rocm	2024-03-10 12:30:56 -07:00
Daniel Hiltgen	00ec269321	Add ollama executable peer dir for rocm This allows people who package up ollama on their own to place the rocm dependencies in a peer directory to the ollama executable much like our windows install flow.	2024-03-10 12:16:30 -07:00
Jeffrey Morgan	908005d90b	patch: use default locale in wpm tokenizer (#3034 )	2024-03-09 21:12:12 -08:00
Jeffrey Morgan	cdf65e793f	only copy deps for `amd64` in `build_linux.sh`	2024-03-09 17:55:22 -08:00
Daniel Hiltgen	82ca694d68	Rename ROCm deps file to avoid confusion (#3025 )	2024-03-09 17:48:38 -08:00
Jeffrey Morgan	5017a15bcb	add `macapp` to `.dockerignore`	2024-03-09 16:07:06 -08:00
Jeffrey Morgan	e11668aa07	add `bundle_metal` and `cleanup_metal` funtions to `gen_darwin.sh`	2024-03-09 16:04:57 -08:00
Jeffrey Morgan	0bd0f4a29c	tidy cleanup logs	2024-03-09 15:56:48 -08:00
Jeffrey Morgan	1ffb1e2874	update llama.cpp submodule to `77d1ac7` (#3030 )	2024-03-09 15:55:34 -08:00
Daniel Hiltgen	0a7844413c	Merge pull request #3026 from dhiltgen/win_rocm_docs Doc how to set up ROCm builds on windows	2024-03-09 14:17:19 -08:00
Jeffrey Morgan	f9cd55c70b	disable gpu for certain model architectures and fix divide-by-zero on memory estimation	2024-03-09 12:51:38 -08:00
Daniel Hiltgen	0fdebb34a9	Doc how to set up ROCm builds on windows	2024-03-09 11:29:45 -08:00
Daniel Hiltgen	ac64cd4ef9	Merge pull request #3008 from dhiltgen/no_more_idempotent Finish unwinding idempotent payload logic	2024-03-09 09:13:24 -08:00
Daniel Hiltgen	4a5c9b8035	Finish unwinding idempotent payload logic The recent ROCm change partially removed idempotent payloads, but the ggml-metal.metal file for mac was still idempotent. This finishes switching to always extract the payloads, and now that idempotentcy is gone, the version directory is no longer useful.	2024-03-09 08:34:39 -08:00
Jeffrey Morgan	efe5617b64	update llama.cpp submodule to `c2101a2` (#3020 )	2024-03-09 00:44:50 -08:00
Jeffrey Morgan	5b3fad9636	separate out `isLocalIP`	2024-03-09 00:22:08 -08:00
Jeffrey Morgan	bfec2c6e10	simplify host checks	2024-03-08 23:29:53 -08:00
Jeffrey Morgan	5c143af726	add additional allowed hosts	2024-03-08 23:23:59 -08:00
Jeffrey Morgan	6c0af2599e	Update docs `README.md` and table of contents	2024-03-08 22:45:11 -08:00
Jeffrey Morgan	fc8c044584	add allowed host middleware and remove `workDir` middleware (#3018 )	2024-03-08 22:23:47 -08:00
Michael Yang	ecc133d843	Merge pull request #3014 from ollama/mxyng/decode-ggla	2024-03-08 16:14:53 -08:00
Michael Yang	76bdebbadf	decode ggla	2024-03-08 15:46:25 -08:00
Michael Yang	18979ad4a1	convert: fix default shape	2024-03-08 15:42:48 -08:00
Michael Yang	8e0ef931d8	Merge pull request #2990 from ollama/mxyng/default-term-size fix: default terminal width, height	2024-03-08 15:20:54 -08:00
Daniel Hiltgen	280da44522	Merge pull request #2988 from dhiltgen/rocm_docs Refined ROCm troubleshooting docs	2024-03-08 13:33:30 -08:00
Bruce MacDonald	0cebc79cba	fix: allow importing a model from name reference (#3005 )	2024-03-08 12:27:47 -05:00
Jeffrey Morgan	0e4669b04f	update llama.cpp submodule to `6cdabe6` (#2999 )	2024-03-08 00:26:20 -08:00
Jeffrey Morgan	b886bec3f9	Update api.md	2024-03-07 23:27:51 -08:00

... 10 11 12 13 14 ...

2725 commits