Jeffrey Morgan
|
287ba11500
|
better error message when calling /api/generate or /api/chat with embedding models
|
2024-02-20 21:53:45 -05:00 |
|
Jeffrey Morgan
|
63861f58cc
|
Support for bert and nomic-bert embedding models
|
2024-02-20 21:37:29 -05:00 |
|
Michael Yang
|
210b65268e
|
replace strings buffer with hasher (#2437)
the buffered value is going into the hasher eventually so write directly
to the hasher instead
|
2024-02-20 19:07:50 -05:00 |
|
Michael Yang
|
897b213468
|
use http.DefaultClient (#2530)
default client already handles proxy
|
2024-02-20 18:34:47 -05:00 |
|
Bruce MacDonald
|
88622847c6
|
fix: chat system prompting overrides (#2542)
|
2024-02-16 14:42:43 -05:00 |
|
Michael Yang
|
e43648afe5
|
rerefactor
|
2024-02-15 05:56:45 +00:00 |
|
Daniel Hiltgen
|
f397e0e988
|
Move hub auth out to new package
|
2024-02-15 05:56:45 +00:00 |
|
Jeffrey Morgan
|
48a273f80b
|
Fix issues with templating prompt in chat mode (#2460)
|
2024-02-12 15:06:57 -08:00 |
|
Jeffrey Morgan
|
1f9078d6ae
|
Check image filetype in api handlers (#2467)
|
2024-02-12 11:16:20 -08:00 |
|
Jeffrey Morgan
|
a0a199b108
|
Fix hanging issue when sending empty content (#2399)
|
2024-02-07 19:30:33 -05:00 |
|
Jeffrey Morgan
|
453f572f83
|
Initial OpenAI /v1/chat/completions API compatibility (#2376)
|
2024-02-07 17:24:29 -05:00 |
|
Michael Yang
|
e805ac1d59
|
fix response on token error
|
2024-02-07 11:05:49 -08:00 |
|
Michael Yang
|
bfbf2f7cf7
|
Merge pull request #2296 from ollama/mxyng/img-tags
append image tags to user content
|
2024-02-01 13:16:59 -08:00 |
|
Michael Yang
|
3d6f48507a
|
structured debug prompt
|
2024-02-01 11:56:28 -08:00 |
|
Michael Yang
|
f3761405c8
|
use image id
|
2024-02-01 11:52:42 -08:00 |
|
Michael Yang
|
e49dc9f3d8
|
fix tests
|
2024-02-01 11:48:11 -08:00 |
|
Michael Yang
|
d125510b4b
|
remove image tags
|
2024-02-01 11:32:51 -08:00 |
|
Michael Yang
|
fb56988014
|
account for image projection in token count
|
2024-02-01 09:50:48 -08:00 |
|
Michael Yang
|
d046bee790
|
use llm.ImageData for chat
|
2024-01-31 19:18:25 -08:00 |
|
Jeffrey Morgan
|
f11bf0740b
|
use llm.ImageData
|
2024-01-31 19:13:48 -08:00 |
|
Michael Yang
|
8450bf66e6
|
trim images
|
2024-01-31 19:13:47 -08:00 |
|
Michael Yang
|
b4e11be8ef
|
append image tags to user content
|
2024-01-31 19:13:10 -08:00 |
|
Bruce MacDonald
|
a896079705
|
preserve last system message from modelfile (#2289)
|
2024-01-31 21:45:01 -05:00 |
|
Michael Yang
|
8ac08a0eec
|
update slog handler options
- consistent format by using text handler for debug and non-debug
- truncate source file to just the file name
|
2024-01-31 15:15:00 -08:00 |
|
Michael Yang
|
c8b1f2369e
|
remove unnecessary parse raw
|
2024-01-30 17:00:53 -08:00 |
|
Bruce MacDonald
|
0632dff3f8
|
trim chat prompt based on llm context size (#1963)
|
2024-01-30 15:59:29 -05:00 |
|
Jeffrey Morgan
|
f2245c7c77
|
print prompt with OLLAMA_DEBUG=1 (#2245)
|
2024-01-28 15:22:35 -08:00 |
|
Jeffrey Morgan
|
e4b9b72f2a
|
Do not repeat system prompt for chat templating (#2241)
|
2024-01-28 14:15:56 -08:00 |
|
Patrick Devine
|
b5cf31b460
|
add keep_alive to generate/chat/embedding api endpoints (#2146)
|
2024-01-26 14:28:02 -08:00 |
|
Michael Yang
|
9d3dcfd0ec
|
fix logging
|
2024-01-26 11:04:27 -08:00 |
|
Michael Yang
|
6e0ea5ecc8
|
Merge pull request #1916 from ollama/mxyng/inactivity-monitor
download: add inactivity monitor
|
2024-01-26 10:56:00 -08:00 |
|
Patrick Devine
|
7c40a67841
|
Save and load sessions (#2063)
|
2024-01-25 12:12:36 -08:00 |
|
Michael Yang
|
c08dfaa23d
|
fix: remove overwritten model layers
if create overrides a manifest, first add the older manifest's layers to
the delete map so they can be cleaned up
|
2024-01-19 14:58:37 -08:00 |
|
Michael Yang
|
aac9ab4db7
|
fix show handler
|
2024-01-18 15:36:50 -08:00 |
|
Michael Yang
|
745b5934fa
|
add model to ModelResponse
|
2024-01-18 14:32:55 -08:00 |
|
Michael Yang
|
a38d88d828
|
api: add model for all requests
prefer using req.Model and fallback to req.Name
|
2024-01-18 14:31:37 -08:00 |
|
Daniel Hiltgen
|
fedd705aea
|
Mechanical switch from log to slog
A few obvious levels were adjusted, but generally everything mapped to "info" level.
|
2024-01-18 14:12:57 -08:00 |
|
Michael Yang
|
96cfb62641
|
fix: normalize name path before splitting
|
2024-01-16 16:48:29 -08:00 |
|
Patrick Devine
|
eef50accb4
|
Fix show parameters (#2017)
|
2024-01-16 10:34:44 -08:00 |
|
Michael Yang
|
27331ae3a8
|
download: add inactivity monitor
if a download part is inactive for some time, restart it
|
2024-01-12 15:23:15 -08:00 |
|
Michael Yang
|
cf29bd2d72
|
fix: request retry with error
this fixes a subtle bug with makeRequestWithRetry where an HTTP status
error on a retried request will potentially not return the right err
|
2024-01-12 13:32:27 -08:00 |
|
Michael Yang
|
2b9892a808
|
fix(windows): modelpath and list
|
2024-01-09 09:36:58 -08:00 |
|
Michael Yang
|
2bb2bdd5d4
|
fix lint
|
2024-01-09 09:36:58 -08:00 |
|
Michael Yang
|
acfc376efd
|
add .golangci.yaml
|
2024-01-09 09:36:58 -08:00 |
|
Bruce MacDonald
|
7e8f7c8358
|
remove ggml automatic re-pull (#1856)
|
2024-01-08 14:41:01 -05:00 |
|
Michael Yang
|
0101e76dbe
|
Merge pull request #1797 from sublimator/nd-allow-extension-origins-still-needs-explicit-listing-2024-01-05
fix: allow extension origins (still needs explicit listing), fixes #1686
|
2024-01-05 17:20:09 -08:00 |
|
Patrick Devine
|
22e93efa41
|
add show info command and fix the modelfile
|
2024-01-05 12:20:05 -08:00 |
|
Nicholas Dudfield
|
8baaaa39c0
|
Allow extension origins (still needs explicit listing), fixes #1686
|
2024-01-05 09:06:47 +07:00 |
|
Bruce MacDonald
|
4ad6c9b11f
|
fix: pull either original model or from model on create (#1774)
|
2024-01-04 01:34:38 -05:00 |
|
Bruce MacDonald
|
0b3118e0af
|
fix: relay request opts to loaded llm prediction (#1761)
|
2024-01-03 12:01:42 -05:00 |
|