Commit graph

  • e4b76dfb76
    docs: Add AnythingLLM to README as integration option (#3145) Timothy Carambat 2024-03-25 11:54:48 -07:00
  • 2c56517494
    Add Saddle (#3178) Jikku Jose 2024-03-26 00:24:09 +05:30
  • cfbc1b152b
    tlm added to README.md terminal section. (#3274) Yusuf Can Bayrak 2024-03-25 19:53:26 +01:00
  • 9305ac1b2e
    Update README.md (#3288) RAPID ARCHITECT 2024-03-25 13:52:25 -05:00
  • 45d6292959
    Update README.md (#3338) drazdra 2024-03-26 00:50:51 +06:00
  • 22921a3969
    doc: specify ADAPTER is optional (#3333) Blake Mizerany 2024-03-25 09:43:19 -07:00
  • 7b6cbc10ec Integration tests conditionally pull Daniel Hiltgen 2024-03-24 16:22:38 -07:00
  • dfc6721b20 add support for libcudart.so for CUDA devices (adds Jetson support) Jeremy 2024-03-25 11:07:44 -04:00
  • acfa2b9422
    llm: prevent race appending to slice (#3320) Blake Mizerany 2024-03-24 11:35:54 -07:00
  • 2c390a73ac
    Merge pull request #3282 from dhiltgen/gpu_docs Daniel Hiltgen 2024-03-24 19:15:03 +01:00
  • 3e30c75f3e Bump llama.cpp to b2510 Daniel Hiltgen 2024-03-23 12:16:06 +01:00
  • 7e430ff352
    Add Testcontainers into Libraries section (#3291) Eddú Meléndez Gonzales 2024-03-23 13:55:25 -05:00
  • 1784113ef5
    Merge pull request #3309 from dhiltgen/integration_testing Daniel Hiltgen 2024-03-23 19:08:49 +01:00
  • 949b6c01e0 Revamp go based integration tests Daniel Hiltgen 2024-03-23 14:24:18 +01:00
  • 38daf0a252 rename .gitattributes jmorganca 2024-03-23 12:40:31 +01:00
  • 43799532c1 Bump llama.cpp to b2474 Daniel Hiltgen 2024-03-23 09:54:56 +01:00
  • d8fdbfd8da Add docs for GPU selection and nvidia uvm workaround Daniel Hiltgen 2024-03-21 11:17:19 +01:00
  • a5ba0fcf78
    doc: faq gpu compatibility (#3142) Bruce MacDonald 2024-03-21 05:21:34 -04:00
  • 3a30bf56dc
    Update faq.md Jeffrey Morgan 2024-03-20 17:48:39 +01:00
  • a1c0a48524
    Merge pull request #3122 from dhiltgen/better_tmp_cleanup Daniel Hiltgen 2024-03-20 16:28:03 +01:00
  • 74788b487c Better tmpdir cleanup Daniel Hiltgen 2024-03-13 11:43:45 -07:00
  • 7ed3e94105
    Update faq.md Jeffrey Morgan 2024-03-18 10:24:39 +01:00
  • 2297ad39da update faq.md jmorganca 2024-03-16 16:44:01 -04:00
  • 01cff6136d
    Merge pull request #3217 from ollama/mxyng/cleanup Michael Yang 2024-03-18 02:13:30 -07:00
  • 3c4ad0ecab dyn global Michael Yang 2024-03-15 16:34:38 -07:00
  • 22f326464e
    Merge pull request #3083 from ollama/mxyng/refactor-readseeker Michael Yang 2024-03-16 12:08:56 -07:00
  • e95ffc7448
    llama: remove server static assets (#3174) Jeffrey Morgan 2024-03-15 19:24:12 -07:00
  • 2dce1ab40b
    add llm/ext_server directory to linguist-vendored (#3173) Jeffrey Morgan 2024-03-15 17:46:46 -07:00
  • f4b31c2d53
    Merge pull request #3111 from alitrack/main Daniel Hiltgen 2024-03-15 16:46:59 -07:00
  • ab3456207b
    Merge pull request #3028 from ollama/ci_release Daniel Hiltgen 2024-03-15 16:40:54 -07:00
  • 6ad414f31e
    Merge pull request #3086 from dhiltgen/import_server Daniel Hiltgen 2024-03-15 16:10:35 -07:00
  • 052b5a3b77
    Merge pull request #3171 from dhiltgen/rocm_94x Daniel Hiltgen 2024-03-15 15:58:33 -07:00
  • d4c10df2b0 Add Radeon gfx940-942 GPU support Daniel Hiltgen 2024-03-15 15:34:58 -07:00
  • 540f4af45f Wire up more complete CI for releases Daniel Hiltgen 2024-03-07 10:54:21 -08:00
  • 6ce37e4d96
    llm,readline: use errors.Is instead of simple == check (#3161) Blake Mizerany 2024-03-15 07:14:12 -07:00
  • 703684a82a
    server: replace blob prefix separator from ':' to '-' (#3146) Blake Mizerany 2024-03-14 20:18:06 -07:00
  • 6459377ae0
    Add ROCm support to linux install script (#2966) Daniel Hiltgen 2024-03-14 18:00:16 -07:00
  • 8546dd3d72
    .github: fix model and feature request yml (#3155) Blake Mizerany 2024-03-14 15:26:06 -07:00
  • 87100be5e0
    .github: add issue templates (#3143) Blake Mizerany 2024-03-14 15:19:10 -07:00
  • e87c780ff9
    Merge pull request #3149 from ollama/mxyng/fix-memory-leak Michael Yang 2024-03-14 13:34:15 -07:00
  • 291c663865 fix: clip memory leak Michael Yang 2024-03-14 12:45:46 -07:00
  • da20786e3e
    Merge pull request #3068 from dhiltgen/win_pipe Daniel Hiltgen 2024-03-14 11:55:19 -07:00
  • 5ce997a7b9
    Update README.md Jeffrey Morgan 2024-03-13 21:12:17 -07:00
  • 672ffe9b7d
    add OLLAMA_KEEP_ALIVE to environment variable docs for ollama serve (#3127) Jeffrey Morgan 2024-03-13 14:35:33 -07:00
  • 47cfe58af5
    Default Keep Alive environment variable (#3094) Patrick Devine 2024-03-13 13:29:40 -07:00
  • c1a81c6fe3 Use stdin for term discovery on windows Daniel Hiltgen 2024-03-11 15:21:57 -07:00
  • 152ab524c2
    Update ollama.iss Steven Lee 2024-03-13 20:15:45 +08:00
  • e72c567cfd
    restore locale patch (#3091) Jeffrey Morgan 2024-03-12 22:08:13 -07:00
  • 3e22611200
    token repeat limit for prediction requests (#3080) Bruce MacDonald 2024-03-12 22:08:25 -04:00
  • a54d4a28dc
    Merge pull request #3088 from dhiltgen/rocm_igpu_linux Daniel Hiltgen 2024-03-12 17:20:27 -07:00
  • 82b0c7c27e Fix iGPU detection for linux Daniel Hiltgen 2024-03-12 16:57:19 -07:00
  • ba7cf7fb66
    add more docs on for the modelfile message command (#3087) Patrick Devine 2024-03-12 16:41:41 -07:00
  • 2f804068bd
    warn when json format is expected but not mentioned in prompt (#3081) Bruce MacDonald 2024-03-12 19:07:11 -04:00
  • 85129d3a32 Adapt our build for imported server.cpp Daniel Hiltgen 2024-03-12 13:51:44 -07:00
  • 9ac6440da3 Import server.cpp as of b2356 Daniel Hiltgen 2024-03-12 13:49:47 -07:00
  • 0085297928 refactor readseeker Michael Yang 2024-03-09 12:28:36 -08:00
  • 34d00f90b1
    Merge pull request #3070 from dhiltgen/visible_devices Daniel Hiltgen 2024-03-12 11:36:46 -07:00
  • b53229a2ed Add docs explaining GPU selection env vars Daniel Hiltgen 2024-03-11 16:54:38 -07:00
  • 53c107e20e
    chore: fix typo (#3073) racerole 2024-03-13 02:09:22 +08:00
  • 51578d8573
    fix gpu_info_cuda.c compile warning (#3077) mofanke 2024-03-13 02:08:40 +08:00
  • b5fcd9d3aa
    use -trimpath when building releases (#3069) Jeffrey Morgan 2024-03-11 15:58:46 -07:00
  • b80661e8c7
    relay load model errors to the client (#3065) Bruce MacDonald 2024-03-11 16:48:27 -04:00
  • 6d3adfbea2
    Update troubleshooting.md Jeffrey Morgan 2024-03-11 13:22:28 -07:00
  • 369eda65f5
    update llama.cpp submodule to ceca1ae (#3064) Jeffrey Morgan 2024-03-11 12:57:48 -07:00
  • f878e91070
    Merge pull request #3044 from ollama/mxyng/fix-convert-shape Michael Yang 2024-03-11 09:56:57 -07:00
  • 0d651478e4
    Merge pull request #3056 from dhiltgen/rocm_link_clash Daniel Hiltgen 2024-03-11 09:48:48 -07:00
  • 9ea492f1ce convert: fix shape Michael Yang 2024-03-10 10:41:40 -07:00
  • bc13da2bfe Avoid rocm runner and dependency clash Daniel Hiltgen 2024-03-11 08:45:57 -07:00
  • 41b00b9856 fix 03-locale.diff Jeffrey Morgan 2024-03-10 16:21:05 -07:00
  • c2a8ed48e7
    Merge pull request #3048 from dhiltgen/harden_rocm_deps Daniel Hiltgen 2024-03-10 15:17:22 -07:00
  • 3dc1bb6a35 Harden for deps file being empty (or short) Daniel Hiltgen 2024-03-10 14:45:38 -07:00
  • 7865a6996a
    Merge pull request #3046 from dhiltgen/rocm_search_paths Daniel Hiltgen 2024-03-10 12:30:56 -07:00
  • 00ec269321 Add ollama executable peer dir for rocm Daniel Hiltgen 2024-03-10 12:13:46 -07:00
  • 908005d90b
    patch: use default locale in wpm tokenizer (#3034) Jeffrey Morgan 2024-03-09 21:12:12 -08:00
  • cdf65e793f only copy deps for amd64 in build_linux.sh Jeffrey Morgan 2024-03-09 17:55:22 -08:00
  • 82ca694d68
    Rename ROCm deps file to avoid confusion (#3025) Daniel Hiltgen 2024-03-09 17:48:38 -08:00
  • 5017a15bcb add macapp to .dockerignore Jeffrey Morgan 2024-03-09 16:07:06 -08:00
  • e11668aa07 add bundle_metal and cleanup_metal funtions to gen_darwin.sh Jeffrey Morgan 2024-03-09 16:04:57 -08:00
  • 0bd0f4a29c tidy cleanup logs Jeffrey Morgan 2024-03-09 15:56:48 -08:00
  • 1ffb1e2874
    update llama.cpp submodule to 77d1ac7 (#3030) Jeffrey Morgan 2024-03-09 15:55:34 -08:00
  • 0a7844413c
    Merge pull request #3026 from dhiltgen/win_rocm_docs Daniel Hiltgen 2024-03-09 14:17:19 -08:00
  • f9cd55c70b disable gpu for certain model architectures and fix divide-by-zero on memory estimation Jeffrey Morgan 2024-03-09 12:51:38 -08:00
  • 0fdebb34a9 Doc how to set up ROCm builds on windows Daniel Hiltgen 2024-03-09 11:29:45 -08:00
  • ac64cd4ef9
    Merge pull request #3008 from dhiltgen/no_more_idempotent Daniel Hiltgen 2024-03-09 09:13:24 -08:00
  • 4a5c9b8035 Finish unwinding idempotent payload logic Daniel Hiltgen 2024-03-08 09:45:55 -08:00
  • efe5617b64
    update llama.cpp submodule to c2101a2 (#3020) Jeffrey Morgan 2024-03-09 00:44:50 -08:00
  • 5b3fad9636 separate out isLocalIP Jeffrey Morgan 2024-03-09 00:22:08 -08:00
  • bfec2c6e10 simplify host checks Jeffrey Morgan 2024-03-08 23:29:53 -08:00
  • 5c143af726 add additional allowed hosts Jeffrey Morgan 2024-03-08 23:23:59 -08:00
  • 6c0af2599e
    Update docs README.md and table of contents Jeffrey Morgan 2024-03-08 22:45:11 -08:00
  • fc8c044584
    add allowed host middleware and remove workDir middleware (#3018) Jeffrey Morgan 2024-03-08 22:23:47 -08:00
  • ecc133d843
    Merge pull request #3014 from ollama/mxyng/decode-ggla Michael Yang 2024-03-08 16:14:53 -08:00
  • 76bdebbadf decode ggla Michael Yang 2024-03-08 15:38:53 -08:00
  • 18979ad4a1 convert: fix default shape Michael Yang 2024-03-08 15:40:16 -08:00
  • 8e0ef931d8
    Merge pull request #2990 from ollama/mxyng/default-term-size Michael Yang 2024-03-08 15:20:54 -08:00
  • 280da44522
    Merge pull request #2988 from dhiltgen/rocm_docs Daniel Hiltgen 2024-03-08 13:33:30 -08:00
  • 0cebc79cba
    fix: allow importing a model from name reference (#3005) Bruce MacDonald 2024-03-08 12:27:47 -05:00
  • 0e4669b04f
    update llama.cpp submodule to 6cdabe6 (#2999) Jeffrey Morgan 2024-03-08 00:26:20 -08:00
  • b886bec3f9
    Update api.md Jeffrey Morgan 2024-03-07 23:27:51 -08:00
  • fc06205971
    Revert "adjust download and upload concurrency based on available bandwidth" (#2995) Jeffrey Morgan 2024-03-07 18:10:16 -08:00