Commit graph

  • 9009bedf13
    better checking for OLLAMA_HOST variable (#3661) Patrick Devine 2024-04-29 19:14:07 -04:00
  • d4ac57e240
    Merge pull request #4035 from dhiltgen/fix_relative_paths Daniel Hiltgen 2024-04-29 16:08:06 -07:00
  • 7b59d1770f Fix relative path lookup Daniel Hiltgen 2024-04-29 16:00:08 -07:00
  • 95ead8ffba
    Restart server on failure when running Windows app (#3985) Jeffrey Morgan 2024-04-29 10:07:52 -04:00
  • 7aa08a77ca
    llm: dont cap context window limit to training context window (#3988) Jeffrey Morgan 2024-04-29 10:07:30 -04:00
  • 7e432cdfac
    types/model: remove old comment (#4020) Blake Mizerany 2024-04-28 20:52:26 -07:00
  • 586672f490
    fix copying model to itself (#4019) Jeffrey Morgan 2024-04-28 23:47:49 -04:00
  • b03408de74
    Merge pull request #3972 from hmartinez82/win_arm64 Daniel Hiltgen 2024-04-28 14:52:58 -07:00
  • 1e6a28bf5b
    Merge pull request #4009 from dhiltgen/cpu_concurrency Daniel Hiltgen 2024-04-28 14:20:27 -07:00
  • d6e3b64582 Fix concurrency for CPU mode Daniel Hiltgen 2024-04-28 13:40:31 -07:00
  • 114c932a8e
    types/model: allow _ as starter character in Name parts (#3991) Blake Mizerany 2024-04-27 21:24:52 -07:00
  • 7f7103de06
    mac: update setup command to llama3 (#3986) Jeffrey Morgan 2024-04-27 22:52:10 -04:00
  • c631a9c726
    types/model: relax name length constraint from 2 to 1 (#3984) Blake Mizerany 2024-04-27 17:58:41 -07:00
  • 8fd9e56804
    types/structs: drop unused structs package (#3981) Blake Mizerany 2024-04-27 14:06:11 -07:00
  • 8a65717f55 Do not build AVX runners on ARM64 Hernan Martinez 2024-04-26 23:41:23 -06:00
  • 6d3152a98a Use architecture specific folders in installer script Hernan Martinez 2024-04-26 23:35:16 -06:00
  • b438d485f1 Use architecture specific folders in the generate script Hernan Martinez 2024-04-26 23:34:12 -06:00
  • 204349b17b Use architecture specific folders in the build script Hernan Martinez 2024-04-26 23:26:03 -06:00
  • 86e67fc4a9 Add import declaration for windows,arm64 to llm.go Hernan Martinez 2024-04-26 22:24:53 -06:00
  • 2bed62926e
    types/model: remove Digest (for now) (#3970) Blake Mizerany 2024-04-26 21:14:28 -07:00
  • aad8d128a0
    also look at cwd as a root for windows runners (#3959) Jeffrey Morgan 2024-04-26 19:14:08 -04:00
  • ec1acbb867
    Merge pull request #3968 from dhiltgen/win_generate Daniel Hiltgen 2024-04-26 16:03:38 -07:00
  • e4859c4563 Fine grain control over windows generate steps Daniel Hiltgen 2024-04-26 15:36:34 -07:00
  • 8e30eb26bd
    Updates the setup command to use llama3. (#3962) Nataly Merezhuk 2024-04-26 18:41:01 -04:00
  • 0b5c589ca2
    Merge pull request #3966 from dhiltgen/bump Daniel Hiltgen 2024-04-26 15:36:53 -07:00
  • 65fadddc85
    Merge pull request #3964 from ollama/mxyng/weights Michael Yang 2024-04-26 15:23:33 -07:00
  • ed5fb088c4 Fix target in gen_windows.ps1 Daniel Hiltgen 2024-04-26 15:10:42 -07:00
  • f81f308118 fix gemma, command-r layer weights Michael Yang 2024-04-26 15:00:54 -07:00
  • b1390a7b37
    types/model: export ParseNameBare and Merge (#3957) Blake Mizerany 2024-04-26 14:58:07 -07:00
  • 11d83386a5
    Merge pull request #3951 from ollama/mxyng/zip Michael Yang 2024-04-26 14:51:23 -07:00
  • bb31def011
    return code 499 when user cancels request while a model is loading (#3955) Jeffrey Morgan 2024-04-26 17:38:29 -04:00
  • 41e03ede95 check file type before zip Michael Yang 2024-04-25 14:41:30 -07:00
  • 7fea1ecdf6
    Merge pull request #3958 from ollama/mxyng/fix-workflow Michael Yang 2024-04-26 14:17:56 -07:00
  • 054894271d
    .github/workflows/test.yaml: add in-flight cancellations on new push (#3956) Blake Mizerany 2024-04-26 13:54:24 -07:00
  • 6fef042f0b use merge base for diff-tree Michael Yang 2024-04-26 13:54:13 -07:00
  • 5c0c2d1d09
    Merge pull request #3954 from dhiltgen/ci_fixes Daniel Hiltgen 2024-04-26 13:09:03 -07:00
  • 37f9c8ad99
    types/model: overhaul Name and Digest types (#3924) Blake Mizerany 2024-04-26 13:08:32 -07:00
  • 2a80f55e2a
    Update windows.md (#3855) Quinten van Buul 2024-04-26 22:04:15 +02:00
  • 421c878a2d Put back non-avx CPU build for windows Daniel Hiltgen 2024-04-26 12:44:07 -07:00
  • 36666c2142
    Merge pull request #3925 from dhiltgen/bump Daniel Hiltgen 2024-04-26 10:09:38 -07:00
  • 85801317d1 Fix clip log import Daniel Hiltgen 2024-04-26 09:43:46 -07:00
  • 2ed0d65948 Bump llama.cpp to b2737 Daniel Hiltgen 2024-04-25 17:15:24 -07:00
  • d459dc4ad1
    Merge pull request #3950 from dhiltgen/windows_packaging Daniel Hiltgen 2024-04-26 09:27:37 -07:00
  • 40bc4622ef Fix exe name for zip packaging on windows Daniel Hiltgen 2024-04-26 09:16:53 -07:00
  • c0f818a07a
    Merge pull request #3948 from dhiltgen/win_generate Daniel Hiltgen 2024-04-26 09:17:20 -07:00
  • 8671fdeda6 Refactor windows generate for more modular usage Daniel Hiltgen 2024-04-25 21:41:33 -07:00
  • 2619850fb4
    Merge pull request #3933 from dhiltgen/ci_fixes Daniel Hiltgen 2024-04-26 07:01:24 -07:00
  • 8feb97dc0d Move cuda/rocm dependency gathering into generate script Daniel Hiltgen 2024-04-25 22:02:10 -07:00
  • 4e1ff6dcbb
    Merge pull request #3926 from dhiltgen/ci_fixes Daniel Hiltgen 2024-04-25 17:42:31 -07:00
  • 8589d752ac Fix release CI Daniel Hiltgen 2024-04-25 17:27:11 -07:00
  • de4ded68b0
    Merge pull request #3923 from ollama/mxyng/mem Michael Yang 2024-04-25 16:34:17 -07:00
  • 9b5a3c5991
    Merge pull request #3914 from dhiltgen/mac_perf Daniel Hiltgen 2024-04-25 16:28:31 -07:00
  • 00b0699c75
    Reload model if num_gpu changes (#3920) Jeffrey Morgan 2024-04-25 19:02:40 -04:00
  • 993cf8bf55
    llm: limit generation to 10x context size to avoid run on generations (#3918) Jeffrey Morgan 2024-04-25 19:02:30 -04:00
  • 7bb7cb8a60 only count output tensors Michael Yang 2024-04-25 14:41:50 -07:00
  • b123be5b71 Adjust context size for parallelism Daniel Hiltgen 2024-04-25 09:38:31 -07:00
  • ddf5c09a9b use matrix multiplcation kernels in more cases jmorganca 2024-04-25 00:33:33 -04:00
  • 5f73c08729
    Remove trailing spaces (#3889) Roy Yang 2024-04-25 11:32:26 -07:00
  • f503a848c2
    Merge pull request #3895 from brycereitano/shiftloading Daniel Hiltgen 2024-04-25 09:24:08 -07:00
  • 36a6daccab Restructure loading conditional chain Bryce Reitano 2024-04-24 17:37:03 -06:00
  • ceb0e26e5e Provide variable ggml for TestLoad Bryce Reitano 2024-04-24 17:19:55 -06:00
  • 284e02bed0 Move ggml loading to when we attempt fitting Bryce Reitano 2024-04-24 17:17:24 -06:00
  • 3450a57d4a
    Merge pull request #3713 from ollama/mxyng/modelname Michael Yang 2024-04-24 16:00:32 -07:00
  • 592dae31c8 update copy to use model.Name Michael Yang 2024-04-16 16:22:38 -07:00
  • 2010cbc5fa
    Merge pull request #3833 from ollama/mxyng/fix-from Michael Yang 2024-04-24 15:13:47 -07:00
  • ac0801eced only replace if it matches command Michael Yang 2024-04-24 14:27:12 -07:00
  • ad66e5b060 split temp zip files Michael Yang 2024-04-22 11:02:25 -07:00
  • ade4b55520
    types/model: make ParseName use default without question (#3886) Blake Mizerany 2024-04-24 11:52:55 -07:00
  • a6d62e0617
    Merge pull request #3882 from dhiltgen/amd_gfx Daniel Hiltgen 2024-04-24 11:07:49 -07:00
  • 6e76348df7
    Merge pull request #3834 from dhiltgen/not_found_in_path Daniel Hiltgen 2024-04-24 10:50:48 -07:00
  • 0d6687f84c AMD gfx patch rev is hex Daniel Hiltgen 2024-04-24 09:43:52 -07:00
  • 74d2a9ef9a
    add OLLAMA_KEEP_ALIVE env variable to FAQ (#3865) Patrick Devine 2024-04-23 21:06:51 -07:00
  • 14476d48cc
    fixes for gguf (#3863) Patrick Devine 2024-04-23 20:57:20 -07:00
  • ce8ce82567
    add mixtral 8x7b model conversion (#3859) Patrick Devine 2024-04-23 20:17:04 -07:00
  • 4dc4f1be34
    types/model: restrict digest hash part to a minimum of 2 characters (#3858) Blake Mizerany 2024-04-23 18:24:17 -07:00
  • 16b52331a4
    Merge pull request #3857 from dhiltgen/mem_escape_valve Daniel Hiltgen 2024-04-23 17:32:24 -07:00
  • 5445aaa94e Add back memory escape valve Daniel Hiltgen 2024-04-23 17:09:02 -07:00
  • 2ac3dd6853
    Merge pull request #3850 from dhiltgen/windows_packaging Daniel Hiltgen 2024-04-23 16:35:20 -07:00
  • d8851cb7a0 Harden sched TestLoad Daniel Hiltgen 2024-04-23 13:07:16 -07:00
  • 058f6cd2cc Move nested payloads to installer and zip file on windows Daniel Hiltgen 2024-04-23 12:19:17 -07:00
  • 790cf34d17
    Merge pull request #3846 from dhiltgen/missing_runner Daniel Hiltgen 2024-04-23 13:14:12 -07:00
  • 928d844896
    adding phi-3 mini to readme Michael 2024-04-23 13:58:31 -04:00
  • 939d6a8606 Make CI lint verbvose Daniel Hiltgen 2024-04-23 10:17:42 -07:00
  • 58888a74bc Detect and recover if runner removed Daniel Hiltgen 2024-04-23 10:05:26 -07:00
  • cc5a71e0e3
    Merge pull request #3709 from remy415/custom-gpu-defs Daniel Hiltgen 2024-04-23 09:28:34 -07:00
  • e83bcf7f9a
    Merge pull request #3836 from ollama/mxyng/mixtral Michael Yang 2024-04-23 09:15:10 -07:00
  • 5690e5ce99
    Merge pull request #3418 from dhiltgen/concurrency Daniel Hiltgen 2024-04-23 08:31:38 -07:00
  • f2ea8470e5 Local unicode test case Daniel Hiltgen 2024-04-16 13:42:52 -07:00
  • 34b9db5afc Request and model concurrency Daniel Hiltgen 2024-03-30 09:50:05 -07:00
  • 8711d03df7 Report errors on server lookup instead of path lookup failure Daniel Hiltgen 2024-04-22 16:22:05 -07:00
  • ee448deaba
    Merge pull request #3835 from dhiltgen/harden_llm_override Daniel Hiltgen 2024-04-22 19:06:54 -07:00
  • 6e8db04716 tidy community integrations Bruce MacDonald 2024-04-22 17:29:08 -07:00
  • 658e60cf73 Revert "stop running model on interactive exit" Bruce MacDonald 2024-04-22 17:23:11 -07:00
  • 4c78f028f8 Merge branch 'main' of https://github.com/ollama/ollama Bruce MacDonald 2024-04-22 17:22:28 -07:00
  • 435cc866a3 fix: mixtral graph Michael Yang 2024-04-22 16:57:05 -07:00
  • c7d3a558f6
    docs: update README to add chat (web UI) for LLM (#3810) Hao Wu 2024-04-23 08:19:39 +08:00
  • 089cdb2877
    docs: Update README for Lobe-chat integration. (#3817) Maple Gao 2024-04-23 08:18:15 +08:00
  • ea1e9aa36b
    Update README.md (#3655) Võ Đình Đạt 2024-04-23 07:16:55 +07:00
  • d0d28ef90d
    Update README.md with Discord-Ollama project (#3633) Jonathan Smoley 2024-04-22 17:14:20 -07:00
  • 6654186a7c
    Add podman-ollama to terminal apps (#3626) Eric Curtin 2024-04-23 01:13:23 +01:00