Commit graph

  • 94e1d96b29
    Updated README section on community projects for table (#550) James Braza 2023-09-18 12:22:50 -0700
  • 66003e1d05
    subprocess improvements (#524) Bruce MacDonald 2023-09-18 15:16:32 -0400
  • c345053a8b
    Merge pull request #537 from jmorganca/mxyng/upload Michael Yang 2023-09-15 17:48:39 -0700
  • 08d7c2a944 fix error on upload chunk Michael Yang 2023-09-15 15:59:30 -0700
  • bc9573dcb1
    Merge pull request #530 from jmorganca/mxyng/progresswriter Michael Yang 2023-09-15 12:43:46 -0700
  • e53bc57d4d split uploadBlobChunked Michael Yang 2023-09-14 13:30:28 -0700
  • f0b398d17f implement ProgressWriter Michael Yang 2023-09-14 09:54:05 -0700
  • 8efbc5df55
    DRAFT: add a simple python client to access ollama (#522) Patrick Devine 2023-09-14 16:37:38 -0700
  • ccc3e9ac6d
    Merge pull request #531 from jmorganca/mxyng/content-length Michael Yang 2023-09-14 13:33:11 -0700
  • daa4f096f9 set request.ContentLength Michael Yang 2023-09-14 10:05:29 -0700
  • 3ee85f1c6c
    Merge pull request #526 from jmorganca/mxyng/cleanup Michael Yang 2023-09-14 13:10:59 -0700
  • 2540c9181c
    support for packaging in multiple cuda runners (#509) Bruce MacDonald 2023-09-14 15:08:13 -0400
  • 83ffb154bc
    Merge pull request #507 from jmorganca/mxyng/build Michael Yang 2023-09-14 11:25:59 -0700
  • 9aa192c812 update cuda docker image Michael Yang 2023-09-08 13:41:49 -0700
  • fc8707686f
    Update API docs (#527) Matt Williams 2023-09-14 08:51:26 -0700
  • f89c23764b
    Merge pull request #525 from jmorganca/mxyng/falcon-decode Michael Yang 2023-09-13 15:08:47 -0700
  • e6881cabd0 remove unused Michael Yang 2023-09-13 11:46:29 -0700
  • d028853879 fix: add falcon.go Michael Yang 2023-09-13 14:47:32 -0700
  • 949553db23
    Merge pull request #519 from jmorganca/mxyng/decode Michael Yang 2023-09-13 12:43:57 -0700
  • 0c5a454361 fix model type for 70b Michael Yang 2023-09-12 10:52:57 -0700
  • f59c4d03f7
    fix ggml arm64 cuda build (#520) Bruce MacDonald 2023-09-12 17:06:48 -0400
  • 7dee25a07f fix falcon decode Michael Yang 2023-09-12 10:01:20 -0700
  • f221637053
    first pass at linux gpu support (#454) Bruce MacDonald 2023-09-12 11:04:35 -0400
  • 45ac07cd02
    create the blobs directory correctly (#508) Patrick Devine 2023-09-11 14:54:52 -0700
  • 7d749cc787 fix darwin build script Jeffrey Morgan 2023-09-11 16:31:46 -0400
  • e7e91cd71c
    add autoprune to remove unused layers (#491) Patrick Devine 2023-09-11 11:46:35 -0700
  • 3920e15386
    add model format to config layer (#497) Jeffrey Morgan 2023-09-09 17:53:44 -0400
  • 41e976edde
    Merge pull request #492 from jmorganca/mxyng/nil-pointer Michael Yang 2023-09-07 17:25:23 -0700
  • de227b620f fix nil pointer dereference Michael Yang 2023-09-07 17:24:31 -0700
  • 63def6ca49
    Merge pull request #487 from jmorganca/mxyng/dockerignore Michael Yang 2023-09-07 14:16:17 -0700
  • 738fe9c4aa
    Merge pull request #486 from jmorganca/mxyng/fix-push Michael Yang 2023-09-07 13:58:34 -0700
  • a8da0bacbe update dockerignore Michael Yang 2023-09-07 13:36:25 -0700
  • bf146fb072 fix retry on unauthorized chunk Michael Yang 2023-09-07 12:01:50 -0700
  • f0f4943577 fix get auth token Michael Yang 2023-09-07 11:49:36 -0700
  • 09dd2aeff9
    GGUF support (#441) Bruce MacDonald 2023-09-07 13:55:37 -0400
  • 07b4074e7b
    [docs] Improve build instructions (#482) Alexander Pepper 2023-09-07 12:43:26 +0200
  • 61dda6a5e0 set minimum CMAKE_OSX_DEPLOYMENT_TARGET to 11.0 Jeffrey Morgan 2023-09-06 19:56:50 -0400
  • e1f9ced568
    Merge pull request #479 from jmorganca/mxyng/dockerfile Michael Yang 2023-09-06 15:44:24 -0700
  • 9795b43d93 update dockerfile Michael Yang 2023-09-06 14:14:20 -0700
  • 0980d5c7e3
    Merge pull request #478 from jmorganca/mxyng/cleanup Michael Yang 2023-09-06 15:18:54 -0700
  • 0dae34b6a7 remove unused openssh key types Michael Yang 2023-09-06 14:33:55 -0700
  • 83c6be1666
    fix model manifests (#477) Michael Yang 2023-09-06 14:30:08 -0700
  • 1adfa67589
    tighten up the error string for ollama show flags (#476) Patrick Devine 2023-09-06 13:38:49 -0700
  • 790d24eb7b
    add show command (#474) Patrick Devine 2023-09-06 11:04:17 -0700
  • 7de300856b use osPath in gpu check Jeffrey Morgan 2023-09-05 21:52:21 -0400
  • 213ffdb548 macos amd64 compatibility fixes Jeffrey Morgan 2023-09-05 21:33:27 -0400
  • d42d88386a
    Merge pull request #473 from jmorganca/mxyng/fix-manifest-path Michael Yang 2023-09-05 17:37:41 -0700
  • 154f24af91
    Added missing options params to the embeddings docs (#472) Ackermann Yuriy 2023-09-06 12:18:49 +1200
  • a1ecdd36d5 create manifests directory Michael Yang 2023-09-05 17:10:40 -0700
  • d18282bfda
    metal: add missing barriers for mul-mat (#469) Bruce MacDonald 2023-09-05 19:37:13 -0400
  • 9ae76ba8c9
    Merge pull request #471 from jmorganca/mxyng/fix-empty-response Michael Yang 2023-09-05 15:23:05 -0700
  • 2bc06565c7 fix empty response Michael Yang 2023-09-05 15:03:24 -0700
  • d1c2558f7e
    Merge pull request #461 from jmorganca/mxyng/fix-inherit-params Michael Yang 2023-09-05 12:30:23 -0700
  • 7b5aefb427
    Merge pull request #462 from jmorganca/mxyng/rm-marshal-prompt Michael Yang 2023-09-05 11:48:41 -0700
  • 06ef90c051 fix parameter inheritence Michael Yang 2023-09-05 11:05:03 -0700
  • 7efbc84320
    Merge pull request #464 from jmorganca/mxyng/fix-num-keep Michael Yang 2023-09-05 11:30:45 -0700
  • e9f6df7dca use slices.DeleteFunc Michael Yang 2023-09-02 14:38:51 -0400
  • 7fa6e51686
    generate binary dependencies based on GOARCH on macos (#459) Jeffrey Morgan 2023-09-05 12:53:57 -0400
  • 8dc68417e7
    Merge pull request #463 from jmorganca/mxyng/fix-last-token Michael Yang 2023-09-05 09:01:32 -0700
  • 681f3c4c42 fix num_keep Michael Yang 2023-09-03 17:36:14 -0400
  • 59a705525c fix not forwarding last token Michael Yang 2023-09-03 17:46:35 -0400
  • 5d3f314b0b remove marshalPrompt which is no longer needed Michael Yang 2023-09-03 14:10:03 -0400
  • adaa13088b
    Merge pull request #457 from sqs/dont-html-escape-prompt Michael Yang 2023-09-01 17:41:53 -0700
  • 62d29b2157 do not HTML-escape prompt Quinn Slack 2023-09-01 17:12:20 -0500
  • ed19d10aa5
    update readme (#451) Michael Yang 2023-09-01 13:44:14 -0700
  • 36c2f45c40
    Merge pull request #450 from jmorganca/mxyng/update-readme Michael Yang 2023-09-01 08:21:49 -0700
  • 742226625f update readme Michael Yang 2023-09-01 10:54:31 -0400
  • 6bb8a16ccb
    Merge pull request #273 from jmorganca/matt/moreexamples Matt Williams 2023-08-31 16:31:59 -0700
  • a5dbcf2e73 app: dont package ggml-metal.metal Jeffrey Morgan 2023-08-31 17:41:09 -0400
  • 9304f0e7a8
    Merge pull request #443 from jmorganca/mxyng/fix-list-models Michael Yang 2023-08-31 14:19:10 -0700
  • 6578b2f8a1
    Merge pull request #448 from callmephilip/patch-1 Michael Yang 2023-08-31 08:57:07 -0700
  • 1c8fd627ad windows: fix create modelfile Michael Yang 2023-08-30 16:01:23 -0400
  • ae950b00f1 windows: fix delete Michael Yang 2023-08-30 14:31:12 -0400
  • eeb40a672c fix list models for windows Michael Yang 2023-08-30 14:14:12 -0400
  • 0f541a0367 s/ListResponseModel/ModelResponse/ Michael Yang 2023-08-30 11:10:27 -0400
  • 1363f537ce
    fix spelling errors in prompt Philip Nuzhnyi 2023-08-31 10:02:46 +0100
  • bc3e21fdc6 update README.md Jeffrey Morgan 2023-08-30 17:56:14 -0400
  • a82eb275ff update docs for subprocess Jeffrey Morgan 2023-08-30 17:54:02 -0400
  • f964aea9a2 remove test not applicate to subprocess Bruce MacDonald 2023-08-30 16:36:11 -0400
  • 42998d797d
    subprocess llama.cpp server (#401) Bruce MacDonald 2023-08-30 16:35:03 -0400
  • f4432e1dba
    treat stop as stop sequences, not exact tokens (#442) Quinn Slack 2023-08-30 10:53:42 -0500
  • 982c535428
    Merge pull request #428 from jmorganca/mxyng/upload-chunks Michael Yang 2023-08-30 07:47:17 -0700
  • 7df342a6ea
    Merge pull request #421 from jmorganca/mxyng/f16-metal Michael Yang 2023-08-29 06:32:59 -0700
  • 8bbff2df98
    add model IDs (#439) Patrick Devine 2023-08-28 20:50:24 -0700
  • 16b06699fd remove unused parameter Michael Yang 2023-08-28 18:35:18 -0400
  • 246dc65417 loosen http status code checks Michael Yang 2023-08-26 21:55:21 -0700
  • 865fceb73c chunked pipe Michael Yang 2023-08-26 08:28:35 -0700
  • 72266c7684 bump chunk size to 95MB Michael Yang 2023-08-25 15:38:39 -0700
  • d3b838ce60 update orca to orca-mini Jeffrey Morgan 2023-08-27 13:26:30 -0400
  • e639a12fa1
    Merge pull request #412 from jmorganca/mxyng/update-readme Michael Yang 2023-08-26 21:26:34 -0700
  • e82fcf30c6
    Merge pull request #420 from jmorganca/mxyng/34b-mem-check Michael Yang 2023-08-26 14:15:52 -0700
  • 495e8b0a6a
    Merge pull request #426 from jmorganca/default-template Michael Yang 2023-08-26 14:15:38 -0700
  • 59734ca24d set default template Michael Yang 2023-08-26 12:20:28 -0700
  • 22ab7f5f88 default host to 127.0.0.1, fixes #424 Jeffrey Morgan 2023-08-26 11:59:28 -0700
  • b25dd1795d allow F16 to use metal Michael Yang 2023-08-26 08:33:03 -0700
  • 304f2b6c96 add 34b to mem check Michael Yang 2023-08-26 08:29:21 -0700
  • 2ecc3a33c3
    delete all models (not just 1st) in ollama rm (#415) Quinn Slack 2023-08-26 00:47:56 -0700
  • ee6e1df118 add codellama to model list in readme Jeffrey Morgan 2023-08-25 20:44:26 -0700
  • 177b69a211 add missing entries for 34B Jeffrey Morgan 2023-08-25 18:35:35 -0700
  • dad63f0821
    Merge pull request #411 from jmorganca/mxyng/34b Michael Yang 2023-08-25 11:59:05 -0700