Commit graph

  • 37096790a7
    Merge pull request #5552 from ollama/mxyng/messages-docs Michael Yang 2024-07-25 16:26:19 -0700
  • 997c903884
    Update docs/template.md Michael Yang 2024-07-25 16:23:40 -0700
  • c8af3c2d96
    server: reuse original download URL for images (#5962) Blake Mizerany 2024-07-25 15:58:30 -0700
  • 455e61170d
    Update openai.md Jeffrey Morgan 2024-07-25 18:34:47 -0400
  • 4de1370a9d
    openai tools doc (#5617) royjhan 2024-07-25 15:34:06 -0700
  • bbf8f102ee
    Revert "llm(llama): pass rope factors (#5924)" (#5963) Jeffrey Morgan 2024-07-25 18:24:55 -0400
  • ce3c93b08f Report better error on cuda unsupported os/arch Daniel Hiltgen 2024-07-24 17:09:20 -0700
  • 6c2129d5d0 Explain font problems on windows 10 Daniel Hiltgen 2024-07-24 15:22:00 -0700
  • 7c2a157ca4 Ensure amd gpu nodes are numerically sorted Daniel Hiltgen 2024-07-24 13:43:26 -0700
  • bb46bbcf5e
    llm(llama): pass rope factors (#5924) Michael Yang 2024-07-24 13:05:59 -0700
  • ac33aa7d37
    Fix Embed Test Flakes (#5893) royjhan 2024-07-24 11:15:46 -0700
  • 830fdd2715 Better explain multi-gpu behavior Daniel Hiltgen 2024-07-23 15:14:28 -0700
  • a6cd8f6169
    Update README.md to add LLMStack integration (#5799) Ajay Chintala 2024-07-23 11:40:23 -0700
  • c78089263a
    Merge pull request #5864 from dhiltgen/bump_go Daniel Hiltgen 2024-07-22 16:34:18 -0700
  • 3e5ea035d5
    Merge pull request #5757 from lreed-mdsol/lreed/bump-go-version-fix-vulnerabilities Daniel Hiltgen 2024-07-22 16:32:43 -0700
  • 5d604eec5b Bump Go patch version Daniel Hiltgen 2024-07-22 16:16:28 -0700
  • db0968f30c
    fix dupe err message (#5857) Josh 2024-07-22 15:48:15 -0700
  • e12fff8810 Enable windows error dialog for subprocess startup Daniel Hiltgen 2024-07-15 09:25:56 -0700
  • 9b60a038e5 update api.md Michael Yang 2024-07-22 13:34:56 -0700
  • 83a0cb8d88 docs Michael Yang 2024-07-02 14:52:18 -0700
  • c0648233f2
    api embed docs (#5282) royjhan 2024-07-22 13:37:08 -0700
  • d835368eb8
    convert: capture head_dim for mistral (#5818) Jeffrey Morgan 2024-07-22 16:16:22 -0400
  • 85d9d73a72 comments Michael Yang 2024-07-08 10:34:12 -0700
  • 78140a712c cleanup tests Michael Yang 2024-07-05 16:52:01 -0700
  • 1954ec5917 uint64 Michael Yang 2024-07-03 19:43:17 -0700
  • 0f1910129f int Michael Yang 2024-07-03 19:41:17 -0700
  • e2c3f6b3e2 string Michael Yang 2024-07-03 19:30:19 -0700
  • 8570c1c0ef keepalive Michael Yang 2024-07-03 18:39:35 -0700
  • 55cd3ddcca bool Michael Yang 2024-07-03 17:22:13 -0700
  • 66fe77f084 models Michael Yang 2024-07-03 17:07:42 -0700
  • d1a5227cad origins Michael Yang 2024-07-03 17:02:07 -0700
  • 4f1afd575d host Michael Yang 2024-07-03 16:44:57 -0700
  • 35b89b2eab rfc: dynamic environ lookup Michael Yang 2024-07-03 16:00:54 -0700
  • 5784c05397
    Merge pull request #5854 from dhiltgen/win_exit_status Daniel Hiltgen 2024-07-22 10:40:22 -0700
  • f14aa5435d
    Merge pull request #5855 from dhiltgen/remove_max_vram Daniel Hiltgen 2024-07-22 10:35:29 -0700
  • f8fedbda20
    Update llama.cpp submodule commit to d94c6e0c (#5805) Jeffrey Morgan 2024-07-22 12:42:00 -0400
  • b3e5491e41
    server: collect nested tool call objects when parsing (#5824) Jeffrey Morgan 2024-07-22 12:38:03 -0400
  • cc269ba094 Remove no longer supported max vram var Daniel Hiltgen 2024-07-22 09:08:11 -0700
  • a3c20e3f18 Refine error reporting for subprocess crash Daniel Hiltgen 2024-07-22 08:52:16 -0700
  • 1d125ce9b7
    Merge https://github.com/ollama/ollama baalajimaestro 2024-07-21 14:17:56 +0530
  • 80ee9b5e47
    Remove out of space test temporarily (#5825) Jeffrey Morgan 2024-07-21 00:22:11 -0400
  • 5534f2cc6a
    llm: consider head_dim in llama arch (#5817) Jeffrey Morgan 2024-07-20 21:48:12 -0400
  • d321297d8a
    Merge pull request #5815 from dhiltgen/win_rocm_gfx_features Daniel Hiltgen 2024-07-20 16:02:55 -0700
  • 06e5d74e34
    Merge pull request #5506 from dhiltgen/sched_tests Daniel Hiltgen 2024-07-20 15:48:39 -0700
  • 5d707e6fd5
    Merge pull request #5583 from dhiltgen/integration_improvements Daniel Hiltgen 2024-07-20 15:48:21 -0700
  • 283948c83b Adjust windows ROCm discovery Daniel Hiltgen 2024-07-19 15:07:26 -0700
  • 1475eab95f
    add patch for tekken (#5807) Jeffrey Morgan 2024-07-20 13:41:21 -0400
  • 20090f3172
    preserve last assistant message (#5802) Jeffrey Morgan 2024-07-19 20:19:26 -0700
  • 69a2d4ccff
    Fix generate test flakyness (#5804) Jeffrey Morgan 2024-07-19 19:11:25 -0700
  • e8b954c646
    server: validate template (#5734) Josh 2024-07-19 15:24:29 -0700
  • c57317cbf0
    OpenAI: Function Based Testing (#5752) royjhan 2024-07-19 11:37:12 -0700
  • 51b2fd299c
    adjust openai chat msg processing (#5729) royjhan 2024-07-19 11:19:20 -0700
  • d0634b1596
    Merge pull request #5780 from ollama/mxyng/tools Michael Yang 2024-07-18 12:14:10 -0700
  • 43606d6d6a fix parsing tool calls Michael Yang 2024-07-18 12:07:59 -0700
  • 70b1010fa5
    server: check for empty tools array too (#5779) Jeffrey Morgan 2024-07-18 11:44:57 -0700
  • 84e5721f3a
    always provide content even if empty (#5778) Jeffrey Morgan 2024-07-18 11:28:19 -0700
  • 319fb1ce03
    server: only parse tool calls if tools are provided (#5771) Jeffrey Morgan 2024-07-18 08:50:23 -0700
  • b255445557
    marshal json automatically for some template values (#5758) Michael Yang 2024-07-17 15:35:11 -0700
  • f02f83660c bump go version to 1.22.5 to fix security vulnerabilities lreed 2024-07-17 21:44:19 +0000
  • b23424bb3c
    Merge pull request #5753 from ollama/mxyng/parse-tool-call Michael Yang 2024-07-17 11:47:53 -0700
  • 5fd6988126 parse tool call as individual objects Michael Yang 2024-07-17 11:02:36 -0700
  • 5b82960df8
    stub response (#5750) Michael Yang 2024-07-17 10:39:22 -0700
  • cc9a252d8c
    Merge pull request #5732 from ollama/mxyng/cleanup Michael Yang 2024-07-17 10:26:54 -0700
  • d281a6e603
    add sidellama link (#5702) Pákozdi György 2024-07-17 19:24:44 +0200
  • 154f6f45d4
    OpenAI: Support Tools (#5614) royjhan 2024-07-16 20:52:59 -0700
  • 0d41623b52
    OpenAI: Add Suffix to v1/completions (#5611) royjhan 2024-07-16 20:50:14 -0700
  • c279f96371 remove ToolCall from GenerateResponse Michael Yang 2024-07-16 14:51:19 -0700
  • 499e87c9ba
    Merge pull request #5730 from ollama/mxyng/cleanup Michael Yang 2024-07-16 14:42:13 -0700
  • cd0853f2d5
    Merge pull request #5207 from ollama/mxyng/suffix Michael Yang 2024-07-16 14:37:32 -0700
  • d290e87513 add suffix support to generate endpoint Michael Yang 2024-06-20 19:13:36 -0700
  • 97c20ede33
    README: Added AI Studio to the list of UIs (#5721) Thorsten Sommer 2024-07-16 23:24:27 +0200
  • 5a83f79afd remove unneeded tool calls Michael Yang 2024-07-16 13:48:38 -0700
  • 987dbab0b0
    OpenAI: /v1/embeddings compatibility (#5285) royjhan 2024-07-16 13:36:08 -0700
  • a8388beb94
    Merge pull request #5726 from ollama/mxyng/tools-templates Michael Yang 2024-07-16 12:12:10 -0700
  • 5afbb60fc4 fix unmarshal type errors Michael Yang 2024-07-16 09:38:46 -0700
  • 4cb5d7decc
    server: omit model system prompt if empty (#5717) Jeffrey Morgan 2024-07-16 11:09:00 -0700
  • 87345eda1b
    Ditch the runner container entirely and use build environment as the runner environment baalajimaestro 2024-07-16 22:42:01 +0530
  • 8eac50dd4f
    Merge pull request #5684 from ollama/mxyng/tests Michael Yang 2024-07-16 09:44:45 -0700
  • 4a565cbf94 add chat and generate tests with mock runner Michael Yang 2024-07-13 17:46:24 -0700
  • 696e20eeae
    Merge https://github.com/ollama/ollama baalajimaestro 2024-07-16 21:50:57 +0530
  • 64039df6d7
    Merge pull request #5284 from ollama/mxyng/tools Michael Yang 2024-07-15 18:03:37 -0700
  • 7ac6d462ec
    server: return empty slice on empty /api/embed request (#5713) Jeffrey Morgan 2024-07-15 17:39:44 -0700
  • ef5136a745 tools test Michael Yang 2024-07-15 12:17:38 -0700
  • 8288ec8824
    Merge pull request #5710 from dhiltgen/rocm_bump Daniel Hiltgen 2024-07-15 15:32:18 -0700
  • d02bbebb11 tools Michael Yang 2024-06-20 13:45:47 -0700
  • 224337b32f Bump linux ROCm to 6.1.2 Daniel Hiltgen 2024-07-15 15:10:22 -0700
  • 9e35d9bbee
    server: lowercase roles for compatibility with clients (#5695) Jeffrey Morgan 2024-07-15 13:55:57 -0700
  • b9f5e16c80
    Introduce /api/embed endpoint supporting batch embedding (#5127) royjhan 2024-07-15 12:14:24 -0700
  • 8c6402d194
    Merge https://github.com/ollama/ollama baalajimaestro 2024-07-14 16:51:20 +0530
  • e9f7f36029
    Support image input for OpenAI chat compatibility (#5208) royjhan 2024-07-13 22:07:45 -0700
  • 057d31861e
    remove template (#5655) Patrick Devine 2024-07-13 20:56:24 -0700
  • f7ee012300 server: prepend system message in chat handler jmorganca 2024-07-13 15:08:00 -0700
  • 1ed0aa8fea
    server: fix context, load_duration and total_duration fields (#5676) Jeffrey Morgan 2024-07-13 09:25:31 -0700
  • ef98803d63
    llm: looser checks for minimum memory (#5677) Jeffrey Morgan 2024-07-13 09:20:05 -0700
  • 02fea420e5
    Add Kerlig AI, an app for macOS (#5675) Jarek 2024-07-13 17:33:46 +0200
  • 22c5451fc2
    fix system prompt (#5662) Michael Yang 2024-07-12 21:04:44 -0700
  • ebc529cbb3 autodetect stop parameters from template Michael Yang 2024-07-05 17:31:23 -0700
  • 23ebbaa46e Revert "remove template from tests" Patrick Devine 2024-07-12 15:47:17 -0700
  • 9ac0a7a50b remove template from tests Patrick Devine 2024-07-12 15:41:31 -0700
  • e5c65a85df
    Merge pull request #5653 from ollama/mxyng/collect-system Michael Yang 2024-07-12 12:32:34 -0700