ollama

Author	SHA1	Message	Date
Michael Yang	bf7edb0d5d	lint linux	2024-06-04 11:13:30 -07:00
Michael Yang	f38353d6b9	stdin.fd	2024-06-04 11:13:30 -07:00
Michael Yang	201d853fdf	nolintlint	2024-06-04 11:13:30 -07:00
Michael Yang	e40145a39d	lint	2024-06-04 11:13:30 -07:00
Michael Yang	c895a7d13f	some gocritic	2024-06-04 11:13:30 -07:00
Michael Yang	dad7a987ae	nosprintfhostport	2024-06-04 11:13:30 -07:00
Michael Yang	8ffb51749f	nolintlint	2024-06-04 11:13:30 -07:00
Michael Yang	55f6eba049	gofmt	2024-06-04 11:13:30 -07:00
Michael Yang	04f3c12bb7	replace x/exp/slices with slices	2024-06-04 11:13:30 -07:00
Shubham	60323e0805	add embed model command and fix question invoke (#4766 ) * add embed model command and fix question invoke * Update docs/tutorials/langchainpy.md Co-authored-by: Kim Hallberg <hallberg.kim@gmail.com> * Update docs/tutorials/langchainpy.md --------- Co-authored-by: Kim Hallberg <hallberg.kim@gmail.com> Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-06-03 22:20:48 -07:00
Jeffrey Morgan	d4a86102fd	update welcome prompt in windows to `llama3` (#4779 )	2024-06-01 21:05:51 -07:00
Jeffrey Morgan	476fb8e892	Limit GPU lib search for now (#4777 ) * fix oneapi errors on windows 10	2024-06-01 19:24:33 -07:00
Michael Yang	829ff87bd1	revert tokenize ffi (#4761 ) * Revert "use `int32_t` for call to tokenize (#4738)" This reverts commit `763bb65dbb`. * Revert "vocab only" This reverts commit `bf54c845e9`. * Revert "use ffi for tokenizing/detokenizing" This reverts commit `26a00a0410`.	2024-05-31 18:54:21 -07:00
Josh	f6b622c4b3	Merge pull request #4733 from ollama/jyan/isvalidname added IsValidNamespace function	2024-05-31 14:08:45 -07:00
Josh Yan	2e4da8eec2	added tests for IsValidNamespace	2024-05-31 11:48:07 -07:00
Jeffrey Morgan	763bb65dbb	use `int32_t` for call to tokenize (#4738 ) * use `int32_t` for call to tokenize * variable naming * cleanup * fix crash	2024-05-30 21:43:30 -07:00
Jeffrey Morgan	7ca9605f54	speed up tests by only building static lib (#4740 )	2024-05-30 21:43:15 -07:00
Michael Yang	eb2c443a79	Merge pull request #4736 from ollama/mxyng/vocab-only vocab only for tokenize	2024-05-30 17:21:00 -07:00
Michael Yang	278e25ea44	Merge pull request #4737 from ollama/mxyng/less-generate only generate on relevant changes	2024-05-30 17:17:50 -07:00
Jeffrey Morgan	a50a87a7b8	partial offloading: allow flash attention and disable mmap (#4734 ) * partial offloading: allow flash attention and disable mmap * allow mmap with num_gpu=0	2024-05-30 16:58:01 -07:00
Michael Yang	98085015d5	only generate on relevant changes	2024-05-30 16:54:11 -07:00
Michael Yang	bf54c845e9	vocab only	2024-05-30 16:49:28 -07:00
Josh Yan	c365f195a8	directly use isvalidpart	2024-05-30 16:40:04 -07:00
Josh	e91d0ef737	Merge pull request #4728 from ollama/jyan/japanese fixed japanese characters deleted at end of line	2024-05-30 16:25:12 -07:00
Jeffrey Morgan	22f5c12ced	Update llama.cpp submodule to `5921b8f0` (#4731 ) * update llama.cpp submodule to `5921b8f089d3b7bda86aac5a66825df6a6c10603` * add patch	2024-05-30 16:20:22 -07:00
Josh Yan	298c996e54	added IsValidNamespace function	2024-05-30 16:02:07 -07:00
Daniel Hiltgen	0fc0cfc6d2	Merge pull request #4594 from dhiltgen/doc_container_workarounds Add isolated gpu test to troubleshooting	2024-05-30 13:10:54 -07:00
Josh Yan	914f68f021	replaced duplicate call with variable	2024-05-30 10:38:07 -07:00
Josh Yan	bd1d119ba9	fixed japanese characters deleted at end of line	2024-05-30 10:24:21 -07:00
Lei Jitang	a03be18189	Fix OLLAMA_LLM_LIBRARY with wrong map name and add more env vars to help message (#4663 ) * envconfig/config.go: Fix wrong description of OLLAMA_LLM_LIBRARY Signed-off-by: Lei Jitang <leijitang@outlook.com> * serve: Add more env to help message of ollama serve Add more enviroment variables to `ollama serve --help` to let users know what can be configurated. Signed-off-by: Lei Jitang <leijitang@outlook.com> --------- Signed-off-by: Lei Jitang <leijitang@outlook.com>	2024-05-30 09:36:51 -07:00
Michael Yang	96bc232b43	Merge pull request #4413 from ollama/mxyng/name-check check if name exists before create/pull/copy	2024-05-29 12:06:58 -07:00
Michael Yang	bca7b12284	Merge pull request #3718 from ollama/mxyng/modelname-3 update delete handler to use model.Name	2024-05-29 12:02:07 -07:00
Michael Yang	32cb1960c1	Merge pull request #4380 from ollama/mxyng/tokenize use tokenize/detokenize	2024-05-29 12:00:59 -07:00
Michael Yang	de781b37c8	rm unused infill	2024-05-29 11:26:47 -07:00
Michael Yang	3e21799377	rm unused system prompt	2024-05-29 11:26:47 -07:00
Michael Yang	26a00a0410	use ffi for tokenizing/detokenizing	2024-05-29 11:26:47 -07:00
Daniel Hiltgen	646371f56d	Merge pull request #3278 from zhewang1-intc/rebase_ollama_main Enabling ollama to run on Intel GPUs with SYCL backend	2024-05-28 16:30:50 -07:00
Jeffrey Morgan	1f5008544b	Update install.sh	2024-05-28 15:01:22 -07:00
Jeffrey Morgan	45cbfc5aee	fix wsl2 status check for nvidia cards (#4689 )	2024-05-28 14:49:46 -07:00
Jeffrey Morgan	6d423b383b	Improve install experience on WSL2 and Linux (#4653 )	2024-05-28 14:41:50 -07:00
Josh	ad897080a2	working on integration of multi-byte and multi-width runes (#4549 ) * integrated runewidth for display management - fixed cursor movement for mutli-width char * updated input and deletion of multi-byte chars * fixed line history with some exceptions * improved insert and add * fixed issues with moving across lines * end of line extra space tracking' * saved changes * fixed end of line issues with empty spaces * worked some more * worked on end of line * fixed failed test * fixed minor inserting bug * fixed movement hotkeys * adjusted hotkeys * removed comments * Update readline/buffer.go Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com> * deleted comments and duplicate code * removed duplicate code * added comments, refactored add function to use addChar * added helper to retrieve lineSpacing, renamed lineFlags for clarity * fixed remove() --------- Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>	2024-05-28 12:04:03 -07:00
Jeffrey Morgan	b7d316d98d	fix nvidia detection in install script (#4683 )	2024-05-28 09:59:36 -07:00
Daniel Hiltgen	d7339fad52	Merge pull request #4682 from dhiltgen/more_time Give the final model loading more time	2024-05-28 09:36:02 -07:00
Daniel Hiltgen	92c81e8117	Give the final model loading more time On some systems, 1 minute isn't sufficient to finish the load after it hits 100% This creates 2 distinct timers, although they're both set to the same value for now so we can refine the timeouts further.	2024-05-28 09:08:10 -07:00
Tai	9db0996ed4	Add OllamaSpring Project to Readme (#4672 ) * Add OllamaSpring Project to Readme * Update README.md --------- Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-05-27 19:58:26 -07:00
Orfeo Ciano	6f43898b17	Adds olpaka flutter client (#4647 ) * Adds olpaka flutter client * Update README.md --------- Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-05-27 17:22:01 -07:00
Lei Jitang	7487229c34	llm/server.go: Fix 2 minor typos (#4661 ) Signed-off-by: Lei Jitang <leijitang@outlook.com>	2024-05-27 17:21:10 -07:00
Rayan Mostovoi	8a8e7afa96	small fix on examples/python-simplechat/client.py to actually get a streamed response and get tokens printed as we receive it (#4671 )	2024-05-27 17:19:20 -07:00
Jeffrey Morgan	c79f8c9c39	Ensure `nvidia` and `nvidia_uvm` kernel modules are loaded in `install.sh` script and at startup (#4652 ) * ensure kernel modules are loaded in `install.sh` script and at startup * indentation * use `SUDO` variable * restart if nouveau is detected * consistent success message for AMD	2024-05-26 14:57:17 -07:00
Jeffrey Morgan	485016bfbb	Update install.sh	2024-05-26 11:46:00 -07:00

1 2 3 4 5 ...

2849 commits