Commit graph

  • 298c996e54 added IsValidNamespace function Josh Yan 2024-05-30 16:02:07 -0700
  • 0fc0cfc6d2
    Merge pull request #4594 from dhiltgen/doc_container_workarounds Daniel Hiltgen 2024-05-30 13:10:54 -0700
  • 914f68f021 replaced duplicate call with variable Josh Yan 2024-05-30 10:38:07 -0700
  • bd1d119ba9 fixed japanese characters deleted at end of line Josh Yan 2024-05-30 10:24:21 -0700
  • a03be18189
    Fix OLLAMA_LLM_LIBRARY with wrong map name and add more env vars to help message (#4663) Lei Jitang 2024-05-31 00:36:51 +0800
  • 96bc232b43
    Merge pull request #4413 from ollama/mxyng/name-check Michael Yang 2024-05-29 12:06:58 -0700
  • bca7b12284
    Merge pull request #3718 from ollama/mxyng/modelname-3 Michael Yang 2024-05-29 12:02:07 -0700
  • 32cb1960c1
    Merge pull request #4380 from ollama/mxyng/tokenize Michael Yang 2024-05-29 12:00:59 -0700
  • de781b37c8 rm unused infill Michael Yang 2024-05-12 09:21:35 -0700
  • 3e21799377 rm unused system prompt Michael Yang 2024-05-12 09:20:39 -0700
  • 26a00a0410 use ffi for tokenizing/detokenizing Michael Yang 2024-05-11 12:49:24 -0700
  • 646371f56d
    Merge pull request #3278 from zhewang1-intc/rebase_ollama_main Daniel Hiltgen 2024-05-28 16:30:50 -0700
  • 1f5008544b
    Update install.sh Jeffrey Morgan 2024-05-28 15:01:22 -0700
  • 45cbfc5aee
    fix wsl2 status check for nvidia cards (#4689) Jeffrey Morgan 2024-05-28 14:49:46 -0700
  • 6d423b383b
    Improve install experience on WSL2 and Linux (#4653) Jeffrey Morgan 2024-05-28 14:41:50 -0700
  • ad897080a2
    working on integration of multi-byte and multi-width runes (#4549) Josh 2024-05-28 12:04:03 -0700
  • b7d316d98d
    fix nvidia detection in install script (#4683) Jeffrey Morgan 2024-05-28 09:59:36 -0700
  • d7339fad52
    Merge pull request #4682 from dhiltgen/more_time Daniel Hiltgen 2024-05-28 09:36:02 -0700
  • 92c81e8117 Give the final model loading more time Daniel Hiltgen 2024-05-28 08:56:18 -0700
  • 9db0996ed4
    Add OllamaSpring Project to Readme (#4672) Tai 2024-05-28 10:58:26 +0800
  • 6f43898b17
    Adds olpaka flutter client (#4647) Orfeo Ciano 2024-05-28 01:22:01 +0100
  • 7487229c34
    llm/server.go: Fix 2 minor typos (#4661) Lei Jitang 2024-05-28 08:21:10 +0800
  • 8a8e7afa96
    small fix on examples/python-simplechat/client.py to actually get a streamed response and get tokens printed as we receive it (#4671) Rayan Mostovoi 2024-05-28 02:19:20 +0200
  • c79f8c9c39
    Ensure nvidia and nvidia_uvm kernel modules are loaded in install.sh script and at startup (#4652) Jeffrey Morgan 2024-05-26 14:57:17 -0700
  • 485016bfbb
    Update install.sh Jeffrey Morgan 2024-05-26 11:46:00 -0700
  • 0165ba1651
    Merge pull request #4638 from dhiltgen/better_error Daniel Hiltgen 2024-05-25 14:32:28 -0700
  • c4209d6d21 Report better warning on client closed abort of load Daniel Hiltgen 2024-05-25 09:23:28 -0700
  • 6adca97f37
    Merge pull request #4619 from noxer/patch-1 Michael Yang 2024-05-24 17:21:57 -0700
  • 9a3c8003c8
    Merge pull request #4624 from ollama/mxyng/fix-5 Michael Yang 2024-05-24 16:11:21 -0700
  • d51f15257c
    Update llm/ggml.go Michael Yang 2024-05-24 16:10:43 -0700
  • 8f440d579a fix q5_0, q5_1 Michael Yang 2024-05-24 16:01:37 -0700
  • 4cc3be3035
    Move envconfig and consolidate env vars (#4608) Patrick Devine 2024-05-24 14:57:15 -0700
  • db2ffa79f1
    Fix download retry issue Tim Scheuermann 2024-05-24 20:30:42 +0200
  • afd2b058b4
    set codesign timeout to longer (#4605) Jeffrey Morgan 2024-05-23 22:46:23 -0700
  • fd5971be0b support ollama run on Intel GPUs Wang,Zhe 2024-05-24 11:18:27 +0800
  • 89bf98bcf2
    Merge pull request #4598 from dhiltgen/docs Daniel Hiltgen 2024-05-23 15:14:29 -0700
  • 1b2d156094 Tidy up developer guide a little Daniel Hiltgen 2024-05-23 14:24:07 -0700
  • 714adb8bd1
    bump (#4597) Michael Yang 2024-05-23 14:16:26 -0700
  • 95b1133d0c
    Merge pull request #4547 from dhiltgen/load_progress Daniel Hiltgen 2024-05-23 14:06:02 -0700
  • b37b496a12 Wire up load progress Daniel Hiltgen 2024-05-20 16:41:43 -0700
  • d6f692ad1a
    Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL (#4322) Bruce MacDonald 2024-05-23 13:21:49 -0700
  • f77713bf1f Add isolated gpu test to troubleshooting Daniel Hiltgen 2024-05-23 09:33:25 -0700
  • 38255d2af1
    Use flash attention flag for now (#4580) Jeffrey Morgan 2024-05-22 21:52:09 -0700
  • 73630a7e85
    add phi 3 medium (#4578) Michael 2024-05-22 12:53:45 -0400
  • 955c317cab
    chore: update tokenizer.go (#4571) Ikko Eltociear Ashimine 2024-05-22 16:25:23 +0900
  • 9f18b88a06
    Merge pull request #4566 from ollama/jyan/shortcuts Josh 2024-05-21 22:49:36 -0700
  • 353f83a9c7 add Ctrl + W shortcut Josh Yan 2024-05-21 16:55:09 -0700
  • 3bade04e10
    doc updates for the faq/troubleshooting (#4565) Patrick Devine 2024-05-21 15:30:09 -0700
  • a6d0f443eb
    Merge pull request #4543 from ollama/mxyng/simple-safetensors Michael Yang 2024-05-21 14:43:55 -0700
  • 96236b7968
    Merge pull request #4268 from ollama/pdevine/llama3 Michael Yang 2024-05-21 14:43:37 -0700
  • 4434d7f447
    Correct typo in error message (#4535) Sang Park 2024-05-22 05:39:01 +0900
  • 171eb040fc simplify safetensors reading Michael Yang 2024-05-20 09:47:01 -0700
  • 3591bbe56f add test Michael Yang 2024-05-21 11:28:16 -0700
  • 34d5ef29b3 fix conversion for f16 or f32 inputs Michael Yang 2024-05-17 12:11:49 -0700
  • bbbd9f20f3 cleanup Michael Yang 2024-05-15 14:55:57 -0700
  • 547132e820 bpe pretokenizer Michael Yang 2024-05-15 11:53:14 -0700
  • 2d315ba9a9 add missing file Patrick Devine 2024-05-08 16:56:18 -0700
  • d355d2020f add fixes for llama Patrick Devine 2024-05-08 16:07:46 -0700
  • c8cf0d94ed llama3 conversion Patrick Devine 2024-04-28 10:36:38 -0700
  • 4730762e5c add safetensors version Patrick Devine 2024-04-24 18:32:01 -0700
  • d88582dffd some changes for llama3 Patrick Devine 2024-04-18 16:00:20 -0700
  • 2f81b3dce2
    Merge pull request #4502 from ollama/mxyng/fix-quantize Michael Yang 2024-05-20 16:09:27 -0700
  • 5cab13739e set llama.cpp submodule commit to 614d3b9 jmorganca 2024-05-20 15:28:17 -0700
  • 8aadad9c72 updated updateURL Josh Yan 2024-05-20 15:24:32 -0700
  • 807d092761 fix quantize file types Michael Yang 2024-05-17 11:29:04 -0700
  • f36f1d6be9 tidy intermediate blobs Michael Yang 2024-05-20 14:58:27 -0700
  • 8800c8a59b
    chore: fix typo in docs (#4536) alwqx 2024-05-21 05:19:03 +0800
  • b4dce13309
    Merge pull request #4330 from ollama/mxyng/cache-intermediate-layers Michael Yang 2024-05-20 13:54:41 -0700
  • e15307fdf4
    feat: add support for flash_attn (#4120) Sam 2024-05-21 06:36:03 +1000
  • 3520c0e4d5 cache and reuse intermediate blobs Michael Yang 2024-05-10 15:48:41 -0700
  • ccdf0b2a44
    Move the parser back + handle utf16 files (#4533) Patrick Devine 2024-05-20 11:26:45 -0700
  • 63a453554d go mod tidy jmorganca 2024-05-19 23:03:57 -0700
  • 105186aa17
    add OLLAMA_NOHISTORY to turn off history in interactive mode (#4508) Patrick Devine 2024-05-18 11:51:57 -0700
  • ba04afc9a4
    Merge pull request #4483 from dhiltgen/clean_exit Daniel Hiltgen 2024-05-17 11:41:57 -0700
  • 7e1e0086e7
    Merge pull request #4482 from dhiltgen/integration_improvements Daniel Hiltgen 2024-05-16 16:43:48 -0700
  • 02b31c9dc8 Don't return error on signal exit Daniel Hiltgen 2024-05-16 16:25:38 -0700
  • 7f2fbad736 Skip max queue test on remote Daniel Hiltgen 2024-05-16 16:24:18 -0700
  • 5bece94509
    Merge pull request #4463 from ollama/jyan/line-display Josh 2024-05-16 14:15:08 -0700
  • 3d90156e99 removed comment Josh Yan 2024-05-16 14:12:03 -0700
  • 5e46c5c435
    Updating software for read me (#4467) Rose Heart 2024-05-16 15:55:14 -0500
  • 583c1f472c
    update llama.cpp submodule to 614d3b9 (#4414) Jeffrey Morgan 2024-05-16 13:53:09 -0700
  • 26bfc1c443 go fmt'd cmd.go Josh Yan 2024-05-15 17:26:39 -0700
  • 799aa9883c go fmt'd cmd.go Josh Yan 2024-05-15 17:24:17 -0700
  • 84ed77cbd8
    Merge pull request #4436 from ollama/mxyng/done-part Michael Yang 2024-05-15 17:16:24 -0700
  • c9e584fb90 updated double-width display Josh Yan 2024-05-15 16:45:24 -0700
  • 17b1e81ca1 fixed width and word count for double spacing Josh Yan 2024-05-15 16:29:33 -0700
  • 7e9a2da097
    Merge pull request #4462 from dhiltgen/opt_out_build Daniel Hiltgen 2024-05-15 16:27:47 -0700
  • c48c1d7c46 Port cuda/rocm skip build vars to linux Daniel Hiltgen 2024-05-15 15:56:43 -0700
  • d1692fd3e0
    fix the cpu estimatedTotal memory + get the expiry time for loading models (#4461) Patrick Devine 2024-05-15 15:43:16 -0700
  • 5fa36a0833
    Merge pull request #4459 from dhiltgen/sanitize_env_log Daniel Hiltgen 2024-05-15 14:58:55 -0700
  • 853ae490e1 Sanitize the env var debug log Daniel Hiltgen 2024-05-15 14:42:57 -0700
  • f2cf97d6f1
    fix typo in modelfile generation (#4439) Patrick Devine 2024-05-14 15:34:29 -0700
  • c344da4c5a
    fix keepalive for non-interactive mode (#4438) Patrick Devine 2024-05-14 15:17:04 -0700
  • 85a57006d1 check if name exists before create/pull/copy Michael Yang 2024-05-13 15:27:51 -0700
  • c5e892cb3e update tests Michael Yang 2024-05-13 14:41:37 -0700
  • 81fb06f530 more resilient Manifests Michael Yang 2024-05-09 10:00:18 -0700
  • a385382ff5 filepath.Join Michael Yang 2024-05-08 15:56:40 -0700
  • b8772a353f remove DeleteModel Michael Yang 2024-05-08 14:54:52 -0700
  • c2714fcbfd routes: use Manifests for ListHandler Michael Yang 2024-05-06 16:34:13 -0700
  • a2fc933fed update delete handler to use model.Name Michael Yang 2024-04-17 17:23:19 -0700