ollama

Author	SHA1	Message	Date
Patrick Devine	d13c3daa0b	add safetensors to the modelfile docs (#6532 )	2024-08-27 14:46:47 -07:00
Patrick Devine	1713eddcd0	Fix import image width (#6528 )	2024-08-27 14:19:47 -07:00
Daniel Hiltgen	4e1c4f6e0b	Update manual instructions with discrete ROCm bundle (#6445 )	2024-08-27 13:42:28 -07:00
Patrick Devine	1c70a00f71	adjust image sizes	2024-08-27 11:15:25 -07:00
Patrick Devine	ac80010db8	update the import docs (#6104 )	2024-08-26 19:57:26 -07:00
Michael Yang	bb362caf88	update faq	2024-08-23 13:37:21 -07:00
Daniel Hiltgen	f9e31da946	Review comments	2024-08-19 10:36:15 -07:00
Daniel Hiltgen	88bb9e3328	Adjust layout to bin+lib/ollama	2024-08-19 09:38:53 -07:00
Bruce MacDonald	eda8a32a09	update chatml template format to latest in docs (#6344 )	2024-08-13 16:39:18 -07:00
Pamela Fox	1f32276178	Update openai.md to remove extra checkbox (#6345 )	2024-08-13 13:36:05 -07:00
Michael Yang	bd5e432630	update import.md	2024-08-12 15:13:29 -07:00
royjhan	5b3a21b578	add metrics to docs (#6079 )	2024-08-07 14:43:44 -07:00
Kyle Kelley	ad0c19dde4	Use llama3.1 in tools example (#5985 ) * Use llama3.1 in tools example * Update api.md	2024-08-07 17:20:50 -04:00
Michael Yang	39f2bc6bfc	Merge pull request #6167 from ollama/mxyng/line-feed line feed	2024-08-05 00:06:28 -07:00
frob	b73b0940ef	Disable paging for journalctl (#6154 ) Users using `journalctl` to get logs for issue logging sometimes don't realize that paging is causing information to be missed.	2024-08-05 00:10:53 -04:00
Michael Yang	6a07344786	line feed	2024-08-04 17:25:41 -07:00
royjhan	4addf6b587	Update OpenAI Compatibility Docs with /v1/completions (#5311 ) * Update docs * token bug corrected * Update docs/openai.md * Update docs/openai.md * add suffix * merge conflicts * merge conflicts	2024-08-02 13:16:23 -07:00
royjhan	85c7f11170	Update docs (#5310 )	2024-08-02 13:05:57 -07:00
Kim Hallberg	ce1fb4447e	Fix models/{model} URL (#6132 )	2024-08-01 16:31:47 -07:00
royjhan	558a54b098	Update OpenAI Compatibility Docs with /v1/embeddings (#5470 ) * docs without usage * no usage * rm metric note	2024-08-01 16:00:29 -07:00
royjhan	ed52833bb1	Add to docs (#5309 )	2024-08-01 15:58:13 -07:00
royjhan	f561eecfb8	Update OpenAI Compatibility Docs with /v1/models (#5151 ) * OpenAI Docs * Update docs/openai.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Remove newline --------- Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-08-01 15:48:44 -07:00
Daniel Hiltgen	1a83581a8e	Merge pull request #5895 from dhiltgen/sched_faq Better explain multi-gpu behavior	2024-07-29 14:25:41 -07:00
Daniel Hiltgen	161e12cecf	Merge pull request #5932 from dhiltgen/win_font Explain font problems on windows 10	2024-07-29 13:40:24 -07:00
Veit Heller	6f26e9322f	Fix typo in image docs (#6041 )	2024-07-29 08:50:53 -07:00
Jeffrey Morgan	0e4d653687	upate to `llama3.1` elsewhere in repo (#6032 )	2024-07-28 19:56:02 -07:00
Tibor Schmidt	f3d7a481b7	feat: add support for min_p (resolve #1142 ) (#1825 )	2024-07-27 14:37:40 -07:00
Jeffrey Morgan	f5e3939220	Update api.md (#5968 )	2024-07-25 23:10:18 -04:00
Jeffrey Morgan	ae27d9dcfd	Update openai.md	2024-07-25 20:27:33 -04:00
Michael Yang	37096790a7	Merge pull request #5552 from ollama/mxyng/messages-docs docs	2024-07-25 16:26:19 -07:00
Michael Yang	997c903884	Update docs/template.md Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-07-25 16:23:40 -07:00
Jeffrey Morgan	455e61170d	Update openai.md	2024-07-25 18:34:47 -04:00
royjhan	4de1370a9d	openai tools doc (#5617 )	2024-07-25 18:34:06 -04:00
Daniel Hiltgen	6c2129d5d0	Explain font problems on windows 10	2024-07-24 15:22:00 -07:00
Daniel Hiltgen	830fdd2715	Better explain multi-gpu behavior	2024-07-23 15:16:38 -07:00
Michael Yang	9b60a038e5	update api.md	2024-07-22 13:49:51 -07:00
Michael Yang	83a0cb8d88	docs	2024-07-22 13:38:09 -07:00
royjhan	c0648233f2	api embed docs (#5282 )	2024-07-22 13:37:08 -07:00
Daniel Hiltgen	283948c83b	Adjust windows ROCm discovery The v5 hip library returns unsupported GPUs which wont enumerate at inference time in the runner so this makes sure we align discovery. The gfx906 cards are no longer supported so we shouldn't compile with that GPU type as it wont enumerate at runtime.	2024-07-20 15:17:50 -07:00
royjhan	0d41623b52	OpenAI: Add Suffix to `v1/completions` (#5611 ) * add suffix * remove todo * remove TODO * add to test * rm outdated prompt tokens info md * fix test * fix test	2024-07-16 20:50:14 -07:00
Daniel Hiltgen	1f50356e8e	Bump ROCm on windows to 6.1.2 This also adjusts our algorithm to favor our bundled ROCm. I've confirmed VRAM reporting still doesn't work properly so we can't yet enable concurrency by default.	2024-07-10 11:01:22 -07:00
Jeffrey Morgan	8f8e736b13	update llama.cpp submodule to `d7fd29f` (#5475 )	2024-07-05 13:25:58 -04:00
Daniel Hiltgen	52abc8acb7	Document older win10 terminal problems We haven't found a workaround, so for now recommend updating.	2024-07-03 17:32:14 -07:00
Daniel Hiltgen	ef757da2c9	Better nvidia GPU discovery logging Refine the way we log GPU discovery to improve the non-debug output, and report more actionable log messages when possible to help users troubleshoot on their own.	2024-07-03 10:50:40 -07:00
Daniel Hiltgen	d2f19024d0	Merge pull request #5442 from dhiltgen/concurrency_docs Add windows radeon concurrency note	2024-07-02 12:47:47 -07:00
Daniel Hiltgen	69c04eecc4	Add windows radeon concurreny note	2024-07-02 12:46:14 -07:00
royjhan	996bb1b85e	OpenAI: /v1/models and /v1/models/{model} compatibility (#5007 ) * OpenAI v1 models * Refactor Writers * Add Test Co-Authored-By: Attila Kerekes * Credit Co-Author Co-Authored-By: Attila Kerekes <439392+keriati@users.noreply.github.com> * Empty List Testing * Use Namespace for Ownedby * Update Test * Add back envconfig * v1/models docs * Use ModelName Parser * Test Names * Remove Docs * Clean Up * Test name Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Add Middleware for Chat and List * Testing Cleanup * Test with Fatal * Add functionality to chat test * OpenAI: /v1/models/{model} compatibility (#5028) * Retrieve Model * OpenAI Delete Model * Retrieve Middleware * Remove Delete from Branch * Update Test * Middleware Test File * Function name * Cleanup * Test Update * Test Update --------- Co-authored-by: Attila Kerekes <439392+keriati@users.noreply.github.com> Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-07-02 11:50:56 -07:00
Daniel Hiltgen	dfded7e075	Merge pull request #5364 from dhiltgen/concurrency_docs Document concurrent behavior and settings	2024-07-01 09:49:48 -07:00
Eduard	27402cb7a2	Update gpu.md (#5382 ) Runs fine on a NVIDIA GeForce GTX 1050 Ti	2024-06-30 21:48:51 -04:00
Jeffrey Morgan	c1218199cf	Update api.md	2024-06-29 16:22:49 -07:00

1 2 3 4 5 ...

354 commits