Michael Yang
|
7eda70f23b
|
copy metadata from source
|
2023-08-17 21:55:25 -07:00 |
|
Michael Yang
|
086449b6c7
|
fmt
|
2023-08-17 15:32:31 -07:00 |
|
Michael Yang
|
3cbc6a5c01
|
fix push manifest
|
2023-08-17 15:28:12 -07:00 |
|
Michael Yang
|
a894cc792d
|
model and file type as strings
|
2023-08-17 12:08:04 -07:00 |
|
Michael Yang
|
b963a83559
|
Merge pull request #364 from jmorganca/chunked-uploads
reimplement chunked uploads
|
2023-08-17 09:58:51 -07:00 |
|
Michael Yang
|
bf6688abe6
|
Merge pull request #360 from jmorganca/fix-request-copies
Fix request copies
|
2023-08-17 09:58:42 -07:00 |
|
Bruce MacDonald
|
6005b157c2
|
retry download on network errors
|
2023-08-17 10:31:45 -04:00 |
|
Patrick Devine
|
14220d9833
|
set the scopes correctly (#368)
|
2023-08-16 21:42:02 -07:00 |
|
Michael Yang
|
5dfe91be8b
|
reimplement chunked uploads
|
2023-08-16 14:50:24 -07:00 |
|
Michael Yang
|
9f944c00f1
|
push: retry on unauthorized
|
2023-08-16 11:35:33 -07:00 |
|
Michael Yang
|
56e87cecb1
|
images: remove body copies
|
2023-08-16 10:30:41 -07:00 |
|
Michael Yang
|
5d9a4cd251
|
Merge pull request #348 from jmorganca/cross-repo-mount
cross repo blob mount
|
2023-08-16 09:20:36 -07:00 |
|
Bruce MacDonald
|
1deb35ca64
|
use loaded llm for generating model file embeddings
|
2023-08-15 16:12:02 -03:00 |
|
Bruce MacDonald
|
e2de886831
|
do not regenerate embeddings
|
2023-08-15 16:10:22 -03:00 |
|
Bruce MacDonald
|
f0d7c2f5ea
|
retry download on network errors
|
2023-08-15 15:07:19 -03:00 |
|
Bruce MacDonald
|
12052a7624
|
always remove from in progress map on download
|
2023-08-15 13:20:32 -03:00 |
|
Bruce MacDonald
|
326de48930
|
use loaded llm for embeddings
|
2023-08-15 10:50:54 -03:00 |
|
Bruce MacDonald
|
18f2cb0472
|
dont log fatal
|
2023-08-15 10:39:59 -03:00 |
|
Michael Yang
|
e26085b921
|
close open files
|
2023-08-14 16:08:06 -07:00 |
|
Michael Yang
|
f594c8eb91
|
cross repo mount
|
2023-08-14 15:07:35 -07:00 |
|
Bruce MacDonald
|
f020e1d519
|
always remove from in progress map on download
|
2023-08-14 13:09:20 -03:00 |
|
Bruce MacDonald
|
2c8b680b03
|
use file info for embeddings cache
|
2023-08-14 12:11:04 -03:00 |
|
Bruce MacDonald
|
99b6b60085
|
use model bin digest for embed digest
|
2023-08-14 11:57:12 -03:00 |
|
Bruce MacDonald
|
e9a9580bdd
|
do not regenerate embeddings
- re-use previously evaluated embeddings when possible
- change embeddings digest identifier to be based on model name and embedded file path
|
2023-08-14 10:34:17 -03:00 |
|
Patrick Devine
|
d9cf18e28d
|
add maximum retries when pushing (#334)
|
2023-08-11 15:41:55 -07:00 |
|
Jeffrey Morgan
|
1556162c90
|
create .ollama directory if it doesnt exist
|
2023-08-11 15:35:55 -07:00 |
|
Jeffrey Morgan
|
148f0225c0
|
create .ollama directory if it doesnt exist
|
2023-08-11 15:33:11 -07:00 |
|
Michael Yang
|
6517bcc53c
|
Merge pull request #290 from jmorganca/add-adapter-layers
implement loading ggml lora adapters through the modelfile
|
2023-08-10 17:23:01 -07:00 |
|
Michael Yang
|
6a6828bddf
|
Merge pull request #167 from jmorganca/decode-ggml
partial decode ggml bin for more info
|
2023-08-10 17:22:40 -07:00 |
|
Patrick Devine
|
be989d89d1
|
Token auth (#314)
|
2023-08-10 11:34:25 -07:00 |
|
Jeffrey Morgan
|
040a5b9750
|
clean up cli flags
|
2023-08-10 09:27:03 -07:00 |
|
Michael Yang
|
6de5d032e1
|
implement loading ggml lora adapters through the modelfile
|
2023-08-10 09:23:39 -07:00 |
|
Michael Yang
|
fccf8d179f
|
partial decode ggml bin for more info
|
2023-08-10 09:23:10 -07:00 |
|
Bruce MacDonald
|
4b3507f036
|
embeddings endpoint
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>
|
2023-08-10 11:45:57 -04:00 |
|
Bruce MacDonald
|
984c9c628c
|
fix embeddings invalid values
|
2023-08-09 16:50:53 -04:00 |
|
Bruce MacDonald
|
ac971c56d1
|
Update images.go
|
2023-08-09 11:31:54 -04:00 |
|
Bruce MacDonald
|
8228d166ce
|
pr comments
|
2023-08-09 11:31:54 -04:00 |
|
Bruce MacDonald
|
907e6c56b3
|
unlock downloadu in case or requestDownload err
|
2023-08-09 11:31:54 -04:00 |
|
Bruce MacDonald
|
868e3b31c7
|
allow for concurrent pulls of the same files
|
2023-08-09 11:31:54 -04:00 |
|
Bruce MacDonald
|
09d8bf6730
|
fix build errors
|
2023-08-09 10:45:57 -04:00 |
|
Bruce MacDonald
|
7a5f3616fd
|
embed text document in modelfile
|
2023-08-09 10:26:19 -04:00 |
|
Jeffrey Morgan
|
cff002b824
|
use content type application/x-ndjson for streaming responses
|
2023-08-08 21:38:10 -07:00 |
|
Bruce MacDonald
|
1bee2347be
|
pr feedback
- defer closing llm on embedding
- do not override licenses
- remove debugging print line
- reformat model file docs
|
2023-08-08 17:01:37 -04:00 |
|
Jeffrey Morgan
|
a027a7dd65
|
add 0.0.0.0 as an allowed origin by default
Fixes #282
|
2023-08-08 13:39:50 -07:00 |
|
Bruce MacDonald
|
884d78ceb3
|
allow embedding from model binary
|
2023-08-08 14:38:57 -04:00 |
|
Bruce MacDonald
|
21ddcaa1f1
|
pr comments
- default to embeddings enabled
- move embedding logic for loaded model to request
- allow embedding full directory
- close llm on reload
|
2023-08-08 13:49:37 -04:00 |
|
Michael Yang
|
f2074ed4c0
|
Merge pull request #306 from jmorganca/default-keep-system
automatically set num_keep if num_keep < 0
|
2023-08-08 09:25:34 -07:00 |
|
Bruce MacDonald
|
a6f6d18f83
|
embed text document in modelfile
|
2023-08-08 11:27:17 -04:00 |
|
Bruce MacDonald
|
34a13a9d05
|
pass flags to serve to allow setting allowed-origins + host and port
|
2023-08-08 10:41:42 -04:00 |
|
Jeffrey Morgan
|
8713ac23a8
|
allow overriding template and system in /api/generate
Fixes #297
Fixes #296
|
2023-08-08 00:55:34 -04:00 |
|