ollama

Author	SHA1	Message	Date
Self Denial	eb76f3e379	Fix CPU-only build under Android Termux enviornment. Update gpu.go initGPUHandles() to declare gpuHandles variable before reading it. This resolves an "invalid memory address or nil pointer dereference" error. Update dyn_ext_server.c to avoid setting the RTLD_DEEPBIND flag under __TERMUX__ (Android).	2024-01-18 17:25:33 -07:00
Michael Yang	d017e3d0a6	Merge pull request #2060 from jmorganca/mxyng/fix-show fix show handler	2024-01-18 16:02:27 -08:00
Michael Yang	aac9ab4db7	fix show handler	2024-01-18 15:36:50 -08:00
Michael Yang	1f5b7ff976	Merge pull request #1932 from jmorganca/mxyng/api-fields api: add model for all requests	2024-01-18 14:56:51 -08:00
Michael Yang	e299831e2c	Merge pull request #1958 from purificant/ci ci: update setup-go action	2024-01-18 14:53:36 -08:00
Michael Yang	745b5934fa	add model to ModelResponse	2024-01-18 14:32:55 -08:00
Michael Yang	a38d88d828	api: add model for all requests prefer using req.Model and fallback to req.Name	2024-01-18 14:31:37 -08:00
Daniel Hiltgen	abec7f06e5	Merge pull request #2056 from dhiltgen/slog Mechanical switch from log to slog	2024-01-18 14:27:24 -08:00
Michael Yang	e5da190bac	Merge pull request #2020 from jmorganca/mxyng/install-fedora install: pin fedora to max 37	2024-01-18 14:23:42 -08:00
Daniel Hiltgen	ecbfc0182f	Go bump to v1.21 to pick up slog	2024-01-18 14:12:57 -08:00
Daniel Hiltgen	fedd705aea	Mechanical switch from log to slog A few obvious levels were adjusted, but generally everything mapped to "info" level.	2024-01-18 14:12:57 -08:00
Mike Bird	82ee019bfc	add open interpreter to list of extensions (#2016 )	2024-01-18 13:59:39 -08:00
Sachin Sachdeva	ad9dbc2a04	Haystack Ollama Integration (#2021 ) Updated readme with the web link for haystack ollama integration	2024-01-18 13:38:32 -08:00
Daniel Hiltgen	fccdf4c635	Merge pull request #1987 from xyproto/archlinux Let gpu.go and gen_linux.sh also find CUDA on Arch Linux	2024-01-18 13:32:10 -08:00
Daniel Hiltgen	d450fb1d1e	Merge pull request #2055 from dhiltgen/cuda_docs Refine the linux cuda/rocm developer docs	2024-01-18 12:07:31 -08:00
Daniel Hiltgen	df40b11d03	Merge pull request #2007 from dhiltgen/cpu_fallback Add multiple CPU variants for Intel Mac	2024-01-18 11:32:29 -08:00
Daniel Hiltgen	9cd20b0ec8	Refine the linux cuda/rocm developer docs	2024-01-18 09:44:44 -08:00
Daniel Hiltgen	b992bf65fc	Disable arm64 for test phase The runners are x86 so we can only run binaries that match.	2024-01-17 19:26:13 -08:00
Daniel Hiltgen	1b249748ab	Add multiple CPU variants for Intel Mac This also refines the build process for the ext_server build.	2024-01-17 15:08:54 -08:00
Alexander F. Rødseth	cbe2adc78a	Merge branch 'main' into archlinux	2024-01-17 12:50:11 +01:00
Michael Yang	d5a7353357	Merge pull request #2026 from jmorganca/mxyng/fix-windows fix: normalize name path before splitting	2024-01-16 16:58:42 -08:00
Michael Yang	96cfb62641	fix: normalize name path before splitting	2024-01-16 16:48:29 -08:00
Daniel Hiltgen	7d00b5d110	Merge pull request #1915 from dhiltgen/bump_llama_with_new_dep Bump llama.cpp to b1842 and add new cuda lib dep	2024-01-16 13:36:49 -08:00
Daniel Hiltgen	795674dd90	Bump llama.cpp to b1842 and add new cuda lib dep Upstream llama.cpp has added a new dependency with the NVIDIA CUDA Driver Libraries (libcuda.so) which is part of the driver distribution, not the general cuda libraries, and is not available as an archive, so we can not statically link it. This may introduce some additional compatibility challenges which we'll need to keep an eye on.	2024-01-16 12:53:52 -08:00
Daniel Hiltgen	e282bdccdd	Merge pull request #1990 from dhiltgen/ci_mac_cross Add macos cross-compile CI coverage	2024-01-16 12:31:37 -08:00
Michael Yang	d9bfb2f08f	install: pin fedora to max 37 repos for fedora 38 and newer do not exist as of this commit ``` $ dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/fedora38/x86_64/cuda-fedora38.repo Adding repo from: https://developer.download.nvidia.com/compute/cuda/repos/fedora38/x86_64/cuda-fedora38.repo Status code: 404 for https://developer.download.nvidia.com/compute/cuda/repos/fedora38/x86_64/cuda-fedora38.repo (IP: 152.195.19.142) Error: Configuration of repo failed ```	2024-01-16 11:45:21 -08:00
Michael Yang	598d6d5572	Merge pull request #1937 from jmorganca/mxyng/remove-client-py remove client.py	2024-01-16 11:01:41 -08:00
Bruce MacDonald	a897e833b8	do not cache prompt (#2018 ) - prompt cache causes inferance to hang after some time	2024-01-16 13:48:05 -05:00
Patrick Devine	eef50accb4	Fix show parameters (#2017 )	2024-01-16 10:34:44 -08:00
Michael Yang	05d53de7a1	Merge pull request #1968 from jmorganca/mxyng/fix-request-retry fix: request retry with error	2024-01-16 10:33:50 -08:00
Daniel Hiltgen	8795447dad	Merge pull request #1966 from fpreiss/fpreiss/gen_linux_cuda_detection improve cuda detection (rel. issue #1704)	2024-01-14 18:00:11 -08:00
Daniel Hiltgen	b3035112a1	Add macos cross-compile CI coverage	2024-01-14 10:38:59 -08:00
Daniel Hiltgen	95ad9a9fc8	Merge pull request #1988 from dhiltgen/fix_intel_mac Fix typo in arm mac arch script	2024-01-14 08:45:18 -08:00
Daniel Hiltgen	3ca5f69ce8	Fix typo in arm mac arch script	2024-01-14 08:32:57 -08:00
Daniel Hiltgen	cfa6337960	Merge pull request #1982 from dhiltgen/fix_intel_mac Fix intel mac build	2024-01-14 08:26:46 -08:00
Alexander F. Rødseth	f4bf1d514f	Let gpu.go and gen_linux.sh also find CUDA on Arch Linux	2024-01-14 13:40:36 +01:00
Jeffrey Morgan	557110d0ba	Disable `mmap` with lora layers (#1985 )	2024-01-13 23:36:31 -05:00
Daniel Hiltgen	2ecb247276	Fix intel mac build Make sure we're building an x86 ext_server lib when cross-compiling	2024-01-13 14:46:34 -08:00
Jeffrey Morgan	288ef8ff95	add `gcc -lstdc++` flag for linux cpu (#1974 )	2024-01-13 03:53:00 -05:00
Jeffrey Morgan	4cf17990f7	use g++ to build `libext_server.so` on linux (#1972 )	2024-01-13 03:12:42 -05:00
Michael Yang	27331ae3a8	download: add inactivity monitor if a download part is inactive for some time, restart it	2024-01-12 15:23:15 -08:00
Michael Yang	b6c0ef1e70	Merge pull request #1961 from jmorganca/mxyng/rm-double-newline remove double newlines in /set parameter	2024-01-12 15:18:19 -08:00
Michael Yang	356d178f6e	Merge pull request #1971 from jmorganca/mxyng/max-context-length add max context length check	2024-01-12 15:10:25 -08:00
Michael Yang	eaed6f8c45	add max context length check	2024-01-12 14:54:07 -08:00
purificant	6a5bfc2ed6	update actions/setup-go	2024-01-12 22:27:25 +00:00
Michael Yang	cf29bd2d72	fix: request retry with error this fixes a subtle bug with makeRequestWithRetry where an HTTP status error on a retried request will potentially not return the right err	2024-01-12 13:32:27 -08:00
Fabian Preiss	905862e17b	improve cuda detection (rel. issue #1704 )	2024-01-12 21:59:19 +01:00
Patrick Devine	565f8a3c44	Convert the REPL to use /api/chat for interactive responses (#1936 )	2024-01-12 12:05:52 -08:00
Michael Yang	5121b7ac9c	remove double newlines in /set parameter	2024-01-12 11:21:15 -08:00
Michael Yang	a70262c6b2	Update README.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-01-12 09:43:04 -08:00

... 6 7 8 9 10 ...

2182 commits