ollama

Author	SHA1	Message	Date
Daniel Hiltgen	0f5b843319	Refine Accelerate usage on mac For old macs, accelerate seems to cause crashes, but for AVX2 capable macs, it does not.	2024-01-22 16:25:56 -08:00
Jeffrey Morgan	ffaf52e1e9	update submodule to `011e8ec577fd135cbc02993d3ea9840c516d6a1c`	2024-01-22 15:16:54 -08:00
Michael Yang	940b10b036	Merge pull request #2144 from jmorganca/mxyng/update-faq faq: update to use launchctl setenv	2024-01-22 13:46:57 -08:00
Daniel Hiltgen	3bc28736cd	Merge pull request #2143 from dhiltgen/llm_verbosity Refine debug logging for llm	2024-01-22 13:19:16 -08:00
Michael Yang	93a756266c	faq: update to use launchctl setenv	2024-01-22 13:10:13 -08:00
Daniel Hiltgen	a0a829bf7a	Merge pull request #2142 from dhiltgen/debug_on_fail Debug logging on init failure	2024-01-22 12:29:22 -08:00
Daniel Hiltgen	730dcfcc7a	Refine debug logging for llm This wires up logging in llama.cpp to always go to stderr, and also turns up logging if OLLAMA_DEBUG is set.	2024-01-22 12:26:49 -08:00
Daniel Hiltgen	27a2d5af54	Debug logging on init failure	2024-01-22 12:08:22 -08:00
Jeffrey Morgan	5f81a33f43	update submodule to `6f9939d` (#2115 )	2024-01-22 11:56:40 -08:00
Michael Yang	6225fde046	Merge pull request #2102 from jmorganca/mxyng/fix-create-override fix: remove overwritten model layers	2024-01-22 09:37:48 -08:00
Meng Zhuo	069184562b	readline: drop not use min function (#2134 )	2024-01-22 08:15:08 -08:00
Daniel Hiltgen	5576bb2348	Merge pull request #2130 from dhiltgen/more_faster Make CPU builds parallel and customizable AMD GPUs	2024-01-21 16:14:12 -08:00
Daniel Hiltgen	2738837786	Merge pull request #2131 from dhiltgen/probe_cards_at_init Probe GPUs before backend init	2024-01-21 16:13:47 -08:00
Daniel Hiltgen	ec3764538d	Probe GPUs before backend init Detect potential error scenarios so we can fallback to CPU mode without hitting asserts.	2024-01-21 15:59:38 -08:00
Daniel Hiltgen	df54c723ae	Make CPU builds parallel and customizable AMD GPUs The linux build now support parallel CPU builds to speed things up. This also exposes AMD GPU targets as an optional setting for advaced users who want to alter our default set.	2024-01-21 15:12:21 -08:00
Daniel Hiltgen	fa8c990e58	Merge pull request #2127 from dhiltgen/rocm_container Combine the 2 Dockerfiles and add ROCm	2024-01-21 11:49:01 -08:00
Daniel Hiltgen	da72235ebf	Combine the 2 Dockerfiles and add ROCm This renames Dockerfile.build to Dockerfile, and adds some new stages to support 2 modes of building - the build_linux.sh script uses intermediate stages to extract the artifacts for ./dist, and the default build generates a container image usable by both cuda and rocm cards. This required transitioniing the x86 base to the rocm image to avoid layer bloat.	2024-01-21 11:37:11 -08:00
Jeffrey Morgan	89c4aee29e	Unlock mutex when failing to load model (#2117 )	2024-01-20 20:54:46 -05:00
Jeffrey Morgan	f32ea81b21	increase minimum overhead to 1024MiB (#2114 )	2024-01-20 17:11:38 -05:00
Jeffrey Morgan	4c54f0ddeb	sign dylibs on macOS (#2101 )	2024-01-19 19:24:11 -05:00
Michael Yang	c08dfaa23d	fix: remove overwritten model layers if create overrides a manifest, first add the older manifest's layers to the delete map so they can be cleaned up	2024-01-19 14:58:37 -08:00
Daniel Hiltgen	3b76e736ae	Merge pull request #2100 from dhiltgen/more_wsl_globs More WSL paths	2024-01-19 13:41:08 -08:00
Daniel Hiltgen	552db98bf1	More WSL paths	2024-01-19 13:23:29 -08:00
Daniel Hiltgen	fdcdfef620	Merge pull request #2099 from dhiltgen/fix_cuda_model_swap Switch to local dlopen symbols	2024-01-19 12:22:04 -08:00
Daniel Hiltgen	6a042438af	Switch to local dlopen symbols	2024-01-19 11:37:02 -08:00
Jeffrey Morgan	dc88cc3981	use `gzip` for runner embedding (#2067 )	2024-01-19 13:23:03 -05:00
Daniel Hiltgen	62976087c6	Merge pull request #1999 from lainedfles/termux_android_cpu_only Fix CPU-only build under Android Termux enviornment.	2024-01-18 17:16:53 -08:00
Self Denial	344342abdf	Restore dyn_ext_server.c since RTLD_DEEPBIND has been removed	2024-01-18 17:30:42 -07:00
Self Denial	eb76f3e379	Fix CPU-only build under Android Termux enviornment. Update gpu.go initGPUHandles() to declare gpuHandles variable before reading it. This resolves an "invalid memory address or nil pointer dereference" error. Update dyn_ext_server.c to avoid setting the RTLD_DEEPBIND flag under __TERMUX__ (Android).	2024-01-18 17:25:33 -07:00
Michael Yang	d017e3d0a6	Merge pull request #2060 from jmorganca/mxyng/fix-show fix show handler	2024-01-18 16:02:27 -08:00
Michael Yang	aac9ab4db7	fix show handler	2024-01-18 15:36:50 -08:00
Michael Yang	1f5b7ff976	Merge pull request #1932 from jmorganca/mxyng/api-fields api: add model for all requests	2024-01-18 14:56:51 -08:00
Michael Yang	e299831e2c	Merge pull request #1958 from purificant/ci ci: update setup-go action	2024-01-18 14:53:36 -08:00
Michael Yang	745b5934fa	add model to ModelResponse	2024-01-18 14:32:55 -08:00
Michael Yang	a38d88d828	api: add model for all requests prefer using req.Model and fallback to req.Name	2024-01-18 14:31:37 -08:00
Daniel Hiltgen	abec7f06e5	Merge pull request #2056 from dhiltgen/slog Mechanical switch from log to slog	2024-01-18 14:27:24 -08:00
Michael Yang	e5da190bac	Merge pull request #2020 from jmorganca/mxyng/install-fedora install: pin fedora to max 37	2024-01-18 14:23:42 -08:00
Daniel Hiltgen	ecbfc0182f	Go bump to v1.21 to pick up slog	2024-01-18 14:12:57 -08:00
Daniel Hiltgen	fedd705aea	Mechanical switch from log to slog A few obvious levels were adjusted, but generally everything mapped to "info" level.	2024-01-18 14:12:57 -08:00
Mike Bird	82ee019bfc	add open interpreter to list of extensions (#2016 )	2024-01-18 13:59:39 -08:00
Sachin Sachdeva	ad9dbc2a04	Haystack Ollama Integration (#2021 ) Updated readme with the web link for haystack ollama integration	2024-01-18 13:38:32 -08:00
Daniel Hiltgen	fccdf4c635	Merge pull request #1987 from xyproto/archlinux Let gpu.go and gen_linux.sh also find CUDA on Arch Linux	2024-01-18 13:32:10 -08:00
Daniel Hiltgen	d450fb1d1e	Merge pull request #2055 from dhiltgen/cuda_docs Refine the linux cuda/rocm developer docs	2024-01-18 12:07:31 -08:00
Daniel Hiltgen	df40b11d03	Merge pull request #2007 from dhiltgen/cpu_fallback Add multiple CPU variants for Intel Mac	2024-01-18 11:32:29 -08:00
Daniel Hiltgen	9cd20b0ec8	Refine the linux cuda/rocm developer docs	2024-01-18 09:44:44 -08:00
Daniel Hiltgen	b992bf65fc	Disable arm64 for test phase The runners are x86 so we can only run binaries that match.	2024-01-17 19:26:13 -08:00
Daniel Hiltgen	1b249748ab	Add multiple CPU variants for Intel Mac This also refines the build process for the ext_server build.	2024-01-17 15:08:54 -08:00
Alexander F. Rødseth	cbe2adc78a	Merge branch 'main' into archlinux	2024-01-17 12:50:11 +01:00
Michael Yang	d5a7353357	Merge pull request #2026 from jmorganca/mxyng/fix-windows fix: normalize name path before splitting	2024-01-16 16:58:42 -08:00
Michael Yang	96cfb62641	fix: normalize name path before splitting	2024-01-16 16:48:29 -08:00

1 2 3 4 5 ...

1859 commits