ollama

Author	SHA1	Message	Date
Daniel Hiltgen	325d74985b	Fix CPU performance on hyperthreaded systems The default thread count logic was broken and resulted in 2x the number of threads as it should on a hyperthreading CPU resulting in thrashing and poor performance.	2023-12-21 16:23:36 -08:00
Bruce MacDonald	fabf2f3467	allow for starting llava queries with filepath (#1549 )	2023-12-21 13:20:59 -05:00
Daniel Hiltgen	d9cd3d9667	Revive windows build The windows native setup still needs some more work, but this gets it building again and if you set the PATH properly, you can run the resulting exe on a cuda system.	2023-12-20 17:21:54 -08:00
Patrick Devine	a607d922f0	add FAQ for slow networking in WSL2 (#1646 )	2023-12-20 16:27:24 -08:00
Daniel Hiltgen	7555ea44f8	Revamp the dynamic library shim This switches the default llama.cpp to be CPU based, and builds the GPU variants as dynamically loaded libraries which we can select at runtime. This also bumps the ROCm library to version 6 given 5.7 builds don't work on the latest ROCm library that just shipped.	2023-12-20 14:45:57 -08:00
Jeffrey Morgan	df06812494	Update api.md	2023-12-20 08:47:53 -05:00
Daniel Hiltgen	1d1eb1688c	Additional nvidial-ml path to check	2023-12-19 15:52:34 -08:00
Michael Yang	23dc179350	Merge pull request #1619 from jmorganca/mxyng/fix-version-test fix(test): use real version string for comparison	2023-12-19 15:48:52 -08:00
Michael Yang	63aac0edc5	fix(test): use real version string for comparison	2023-12-19 15:03:02 -08:00
Daniel Hiltgen	6558f94ed0	Fix darwin intel build	2023-12-19 13:32:24 -08:00
Erick Ghaumez	1ca484f67e	Add Langchain Dart library (#1564 ) * Add Langchain Dart * Update README.md --------- Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>	2023-12-19 14:04:52 -05:00
Jeffrey Morgan	72b0c32fe9	Update README.md	2023-12-19 12:59:22 -05:00
Jeffrey Morgan	68c28224f8	Update README.md	2023-12-19 12:59:03 -05:00
Daniel Hiltgen	54dbfa4c4a	Carry ggml-metal.metal as payload	2023-12-19 09:05:46 -08:00
Daniel Hiltgen	5646826a79	Add WSL2 path to nvidia-ml.so library	2023-12-19 09:05:46 -08:00
Daniel Hiltgen	3269535a4c	Refine handling of shim presence This allows the CPU only builds to work on systems with Radeon cards	2023-12-19 09:05:46 -08:00
Daniel Hiltgen	1b991d0ba9	Refine build to support CPU only If someone checks out the ollama repo and doesn't install the CUDA library, this will ensure they can build a CPU only version	2023-12-19 09:05:46 -08:00
Daniel Hiltgen	51082535e1	Add automated test for multimodal A simple test case that verifies llava:7b can read text in an image	2023-12-19 09:05:46 -08:00
Daniel Hiltgen	9adca7f711	Bump llama.cpp to b1662 and set n_parallel=1	2023-12-19 09:05:46 -08:00
Daniel Hiltgen	89bbaafa64	Build linux using ubuntu 20.04 This changes the container-based linux build to use an older Ubuntu distro to improve our compatibility matrix for older user machines	2023-12-19 09:05:46 -08:00
Daniel Hiltgen	35934b2e05	Adapted rocm support to cgo based llama.cpp	2023-12-19 09:05:46 -08:00
65a	f8ef4439e9	Use build tags to generate accelerated binaries for CUDA and ROCm on Linux. The build tags rocm or cuda must be specified to both go generate and go build. ROCm builds should have both ROCM_PATH set (and the ROCM SDK present) as well as CLBlast installed (for GGML) and CLBlast_DIR set in the environment to the CLBlast cmake directory (likely /usr/lib/cmake/CLBlast). Build tags are also used to switch VRAM detection between cuda and rocm implementations, using added "accelerator_foo.go" files which contain architecture specific functions and variables. accelerator_none is used when no tags are set, and a helper function addRunner will ignore it if it is the chosen accelerator. Fix go generate commands, thanks @deadmeu for testing.	2023-12-19 09:05:46 -08:00
Daniel Hiltgen	d4cd695759	Add cgo implementation for llama.cpp Run the server.cpp directly inside the Go runtime via cgo while retaining the LLM Go abstractions.	2023-12-19 09:05:46 -08:00
Bruce MacDonald	5e7fd6906f	Update images.go	2023-12-19 09:05:46 -08:00
Bruce MacDonald	811b1f03c8	deprecate ggml - remove ggml runner - automatically pull gguf models when ggml detected - tell users to update to gguf in the case automatic pull fails Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>	2023-12-19 09:05:46 -08:00
Matt Williams	ed195f3562	Merge pull request #1595 from pgibler/main Added cmdh to community section in README	2023-12-18 20:55:18 -08:00
Matt Williams	e0d0072ef1	Merge pull request #1592 from jmorganca/mattw/examplepruning Lets get rid of these old modelfile examples	2023-12-18 20:29:48 -08:00
pgibler	620a2ffcfb	Added cmdh to community section in README	2023-12-18 22:04:40 -05:00
Matt Williams	d287013f24	Lets get rid of these old modelfile examples Signed-off-by: Matt Williams <m@technovangelist.com>	2023-12-18 17:47:33 -08:00
Jeffrey Morgan	6b5bdfa6c9	update runner submodule	2023-12-18 17:33:46 -05:00
Jeffrey Morgan	c063ee4af0	update runner submodule to fix hipblas build	2023-12-18 15:41:13 -05:00
Bruce MacDonald	d99fa6ce0a	send empty messages on last chat response (#1530 )	2023-12-18 14:23:38 -05:00
Patrick Devine	3948c6ea06	add magic header for unit tests (#1558 )	2023-12-18 10:41:02 -08:00
Jeffrey Morgan	b85982eb91	update runner submodule	2023-12-18 12:43:31 -05:00
Patrick Devine	86b0dd4b16	add API create/copy handlers (#1541 )	2023-12-15 11:59:18 -08:00
Augustinas Malinauskas	f728738427	README with Enchanted iOS App (#1529 ) * feat(docs): README with Enchanted iOS app * Update README.md --------- Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>	2023-12-15 14:37:29 -05:00
Ian Purton	115048a0d8	Added Bionic GPT as a front end. (#1463 ) * Added Bionic GPT as a front end. * Update README.md --------- Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>	2023-12-15 14:33:04 -05:00
Bruce MacDonald	1b417a7836	use exp slices for go 1.20 compatibility (#1544 )	2023-12-15 14:15:56 -05:00
Patrick Devine	0174665d0e	add API tests for list handler (#1535 )	2023-12-14 18:18:25 -08:00
Patrick Devine	630518f0d9	Add unit test of API routes (#1528 )	2023-12-14 16:47:40 -08:00
Bruce MacDonald	6e16098a60	remove sample_count from docs (#1527 ) this info has not been returned from these endpoints in some time	2023-12-14 17:49:00 -05:00
Bruce MacDonald	6ee8c80199	restore model load duration on generate response (#1524 ) * restore model load duration on generate response - set model load duration on generate and chat done response - calculate createAt time when response created * remove checkpoints predict opts * Update routes.go	2023-12-14 12:15:50 -05:00
Jeffrey Morgan	31f0551dab	Update runner to support mixtral and mixture of experts (MoE) (#1475 )	2023-12-13 17:15:10 -05:00
Jeffrey Morgan	4a1abfe4fa	fix tests	2023-12-13 14:42:30 -05:00
Jeffrey Morgan	bbd41494bf	add multimodal to `README.md`	2023-12-13 14:38:47 -05:00
Jeffrey Morgan	fedba24a63	Docs for multimodal support (#1485 ) * add multimodal docs * add chat api docs * consistency between `/api/generate` and `/api/chat` * simplify docs	2023-12-13 13:59:33 -05:00
pepperoni21	e3b090dbc5	Added message format for chat api (#1488 )	2023-12-13 11:21:23 -05:00
Patrick Devine	d9e60f634b	add image support to the chat api (#1490 )	2023-12-12 13:28:58 -08:00
Michael Yang	4251b342de	Merge pull request #1469 from jmorganca/mxyng/model-types remove per-model types	2023-12-12 12:27:03 -08:00
Jeffrey Morgan	0a9d348023	Fix issues with `/set template` and `/set system` (#1486 )	2023-12-12 14:43:19 -05:00

... 9 10 11 12 13 ...

2144 commits