ollama

Author	SHA1	Message	Date
Daniel Hiltgen	d5ec730354	Merge pull request #1779 from dhiltgen/refined_amd_gpu_list Improve maintainability of Radeon card list	2024-01-03 16:18:57 -08:00
Daniel Hiltgen	8bed487aba	Merge pull request #1778 from dhiltgen/wsl1 Fail fast on WSL1 while allowing on WSL2	2024-01-03 16:18:41 -08:00
Daniel Hiltgen	c1a10a6e9b	Merge pull request #1781 from dhiltgen/cpu_only_build Fix CPU only builds	2024-01-03 16:18:25 -08:00
Daniel Hiltgen	ddbfa6fe31	Fix CPU only builds Go embed doesn't like when there's no matching files, so put a dummy placeholder in to allow building without any GPU support If no "server" library is found, it's safely ignored at runtime.	2024-01-03 16:08:34 -08:00
Daniel Hiltgen	2fcd41ef81	Fail fast on WSL1 while allowing on WSL2 This prevents users from accidentally installing on WSL1 with instructions guiding how to upgrade their WSL instance to version 2. Once running WSL2 if you have an NVIDIA card, you can follow their instructions to set up GPU passthrough and run models on the GPU. This is not possible on WSL1.	2024-01-03 16:02:32 -08:00
Daniel Hiltgen	16f4603b67	Improve maintainability of Radeon card list This moves the list of AMD GPUs to an easier to maintain list which should make it easier to update over time.	2024-01-03 15:16:56 -08:00
Daniel Hiltgen	1184686649	Merge pull request #1776 from dhiltgen/render_group Add ollama user to render group for Radeon support	2024-01-03 13:07:54 -08:00
Daniel Hiltgen	2588cb2daa	Add ollama user to render group for Radeon support For the ROCm libraries to access the driver, we need to add the ollama user to the render group.	2024-01-03 12:56:31 -08:00
Jeffrey Morgan	c7ea8f237e	set `num_gpu` to 1 only by default on darwin arm64 (#1771 )	2024-01-03 14:10:29 -05:00
Bruce MacDonald	0b3118e0af	fix: relay request opts to loaded llm prediction (#1761 )	2024-01-03 12:01:42 -05:00
Daniel Hiltgen	05face44ef	Merge pull request #1683 from dhiltgen/fix_windows_test Fix windows system memory lookup	2024-01-03 09:00:39 -08:00
Daniel Hiltgen	a2ad952440	Fix windows system memory lookup This refines the gpu package error handling and fixes a bug with the system memory lookup on windows.	2024-01-03 08:50:01 -08:00
Daniel Hiltgen	5fea4410be	Merge pull request #1680 from dhiltgen/better_patching Refactor how we augment llama.cpp and refine windows native build	2024-01-03 08:10:17 -08:00
Bruce MacDonald	b846eb64d0	Fix `template` api doc description (#1661 )	2024-01-03 11:00:59 -05:00
Cole Gillespie	3c5dd9ed1d	Update README.md (#1766 )	2024-01-03 10:44:22 -05:00
Jeffrey Morgan	b17ccd0542	Update import.md	2024-01-02 22:28:18 -05:00
Patrick Devine	d0409f772f	keyboard shortcut help (#1764 )	2024-01-02 18:04:12 -08:00
Jeffrey Morgan	ec261422af	use `docker build` in build scripts	2024-01-02 19:32:54 -05:00
Daniel Hiltgen	0498f7ce56	Get rid of one-line llama.log This one log line was triggering a single line llama.log to be generated in the pwd of the server	2024-01-02 15:36:16 -08:00
Daniel Hiltgen	738a8d12eb	Rename the ollama cmakefile	2024-01-02 15:36:16 -08:00
Daniel Hiltgen	d966b730ac	Switch windows build to fully dynamic Refactor where we store build outputs, and support a fully dynamic loading model on windows so the base executable has no special dependencies thus doesn't require a special PATH.	2024-01-02 15:36:16 -08:00
Daniel Hiltgen	9a70aecccb	Refactor how we augment llama.cpp This changes the model for llama.cpp inclusion so we're not applying a patch, but instead have the C++ code directly in the ollama tree, which should make it easier to refine and update over time.	2024-01-02 15:35:55 -08:00
Karim ElGhandour	22cd5eaab6	Added Ollama-SwiftUI to integrations (#1747 )	2024-01-02 09:47:50 -05:00
Dane Madsen	304a8799ca	Update README.md (#1757 )	2024-01-02 09:47:08 -05:00
Jeffrey Morgan	2a2fa3c329	`api.md` cleanup & formatting	2023-12-27 14:32:35 -05:00
Jeffrey Morgan	55978c1dc9	clean up cache api option	2023-12-27 14:27:45 -05:00
Jeffrey Morgan	d4ebdadbe7	enable `cache_prompt` by default	2023-12-27 14:23:42 -05:00
Daniel Hiltgen	e201efa14b	Add windows native build instructions	2023-12-25 08:31:34 -08:00
Icelain	c5f21f73a4	follow best practices by adding resp.Body.Close() (#1708 )	2023-12-25 09:01:37 -05:00
Jeffrey Morgan	371bc73531	Update README.md	2023-12-24 11:54:08 -05:00
Jeffrey Morgan	c651d8b824	Update README.md	2023-12-23 11:18:12 -05:00
Daniel Hiltgen	cf50ef5b51	Merge pull request #1684 from dhiltgen/tag_integration_tests Guard integration tests with a tag	2023-12-22 16:43:41 -08:00
Daniel Hiltgen	697bea6939	Guard integration tests with a tag This should help CI avoid running the integration test logic in a container where it's not currently possible.	2023-12-22 16:33:27 -08:00
K0IN	10da41d677	Add Cache flag to api (#1642 )	2023-12-22 17:16:20 -05:00
Bruce MacDonald	db356c8519	post-response templating (#1427 )	2023-12-22 17:07:05 -05:00
Jeffrey Morgan	b80081022f	cache docker builds in `build_linux.sh`	2023-12-22 16:01:20 -05:00
Matt Williams	790457398a	Merge pull request #1677 from jmorganca/mattw/docrunupdate update where are models stored q	2023-12-22 09:56:27 -08:00
Matt Williams	511069a2a5	update where are models stored q Signed-off-by: Matt Williams <m@technovangelist.com>	2023-12-22 09:48:44 -08:00
Matt Williams	5a85070c22	Update readmes, requirements, packagejsons, etc for all examples (#1452 ) Most of the examples needed updates of Readmes to show how to run them. Some of the requirements.txt files had extra content that wasn't needed, or missing altogether. Apparently some folks like to run npm start to run typescript, so a script was added to all typescript examples which hadn't been done before. Basically just a lot of cleanup. Signed-off-by: Matt Williams <m@technovangelist.com>	2023-12-22 09:10:41 -08:00
Matt Williams	291700c92d	Clean up documentation (#1506 ) * Clean up documentation Will probably need to update with PRs for new release. Signed-off-by: Matt Williams <m@technovangelist.com> * Correcting to fit in 0.1.15 changes Signed-off-by: Matt Williams <m@technovangelist.com> * Update README.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * addressing comments Signed-off-by: Matt Williams <m@technovangelist.com> * more api cleanup Signed-off-by: Matt Williams <m@technovangelist.com> * its llava not llama Signed-off-by: Matt Williams <m@technovangelist.com> * Update docs/troubleshooting.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Updated hosting to server and documented all env vars Signed-off-by: Matt Williams <m@technovangelist.com> * remove last of the cli descriptions Signed-off-by: Matt Williams <m@technovangelist.com> * Update README.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * update further per conversation with jeff earlier today Signed-off-by: Matt Williams <m@technovangelist.com> * cleanup the doc readme Signed-off-by: Matt Williams <m@technovangelist.com> * move upgrade to faq Signed-off-by: Matt Williams <m@technovangelist.com> * first change Signed-off-by: Matt Williams <m@technovangelist.com> * updated Signed-off-by: Matt Williams <m@technovangelist.com> * Update docs/faq.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/api.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/api.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/api.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/api.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/api.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/api.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/README.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/api.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/api.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/api.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update README.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/README.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/api.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/api.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/api.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/README.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/README.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/README.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * examples in parent Signed-off-by: Matt Williams <m@technovangelist.com> * add exapmle for create model. Signed-off-by: Matt Williams <m@technovangelist.com> * update faq Signed-off-by: Matt Williams <m@technovangelist.com> * update create model api Signed-off-by: Matt Williams <m@technovangelist.com> * Update docs/api.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/faq.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/troubleshooting.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * update the readme in docs Signed-off-by: Matt Williams <m@technovangelist.com> * update a few more things Signed-off-by: Matt Williams <m@technovangelist.com> * Update docs/troubleshooting.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/faq.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update README.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/modelfile.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update docs/troubleshooting.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> --------- Signed-off-by: Matt Williams <m@technovangelist.com> Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2023-12-22 09:10:01 -08:00
Daniel Hiltgen	9db28af84e	Merge pull request #1675 from dhiltgen/less_verbose Quiet down llama.cpp logging by default	2023-12-22 08:57:17 -08:00
Daniel Hiltgen	e5202eb687	Quiet down llama.cpp logging by default By default builds will now produce non-debug and non-verbose binaries. To enable verbose logs in llama.cpp and debug symbols in the native code, set `CGO_CFLAGS=-g`	2023-12-22 08:47:18 -08:00
Daniel Hiltgen	96fb441abd	Merge pull request #1146 from dhiltgen/ext_server_cgo Add cgo implementation for llama.cpp	2023-12-22 08:16:31 -08:00
Daniel Hiltgen	495c06e4a6	Fix doc glitch	2023-12-21 18:21:31 -08:00
Daniel Hiltgen	fa24e73b82	Remove CPU build, fixup linux build script	2023-12-21 18:21:31 -08:00
Daniel Hiltgen	325d74985b	Fix CPU performance on hyperthreaded systems The default thread count logic was broken and resulted in 2x the number of threads as it should on a hyperthreading CPU resulting in thrashing and poor performance.	2023-12-21 16:23:36 -08:00
Bruce MacDonald	fabf2f3467	allow for starting llava queries with filepath (#1549 )	2023-12-21 13:20:59 -05:00
Daniel Hiltgen	d9cd3d9667	Revive windows build The windows native setup still needs some more work, but this gets it building again and if you set the PATH properly, you can run the resulting exe on a cuda system.	2023-12-20 17:21:54 -08:00
Patrick Devine	a607d922f0	add FAQ for slow networking in WSL2 (#1646 )	2023-12-20 16:27:24 -08:00
Daniel Hiltgen	7555ea44f8	Revamp the dynamic library shim This switches the default llama.cpp to be CPU based, and builds the GPU variants as dynamically loaded libraries which we can select at runtime. This also bumps the ROCm library to version 6 given 5.7 builds don't work on the latest ROCm library that just shipped.	2023-12-20 14:45:57 -08:00

... 6 7 8 9 10 ...

2039 commits