Michael Yang
e53bc57d4d
split uploadBlobChunked
2023-09-14 17:22:05 -07:00
Michael Yang
f0b398d17f
implement ProgressWriter
2023-09-14 17:22:04 -07:00
Patrick Devine
8efbc5df55
DRAFT: add a simple python client to access ollama ( #522 )
2023-09-14 16:37:38 -07:00
Michael Yang
ccc3e9ac6d
Merge pull request #531 from jmorganca/mxyng/content-length
...
set request.ContentLength
2023-09-14 13:33:11 -07:00
Michael Yang
daa4f096f9
set request.ContentLength
...
This informs the HTTP client the content length is known and disables
chunked Transfer-Encoding
2023-09-14 13:32:44 -07:00
Michael Yang
3ee85f1c6c
Merge pull request #526 from jmorganca/mxyng/cleanup
...
remove unused
2023-09-14 13:10:59 -07:00
Bruce MacDonald
2540c9181c
support for packaging in multiple cuda runners ( #509 )
...
* enable packaging multiple cuda versions
* use nvcc cuda version if available
---------
Co-authored-by: Michael Yang <mxyng@pm.me>
2023-09-14 15:08:13 -04:00
Michael Yang
83ffb154bc
Merge pull request #507 from jmorganca/mxyng/build
...
update docker image
2023-09-14 11:25:59 -07:00
Michael Yang
9aa192c812
update cuda docker image
2023-09-14 11:25:20 -07:00
Matt Williams
fc8707686f
Update API docs ( #527 )
...
* Update API docs
Signed-off-by: Matt Williams <m@technovangelist.com>
* strange TOC was getting auto generated
Signed-off-by: Matt Williams <m@technovangelist.com>
* Update docs/api.md
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
* Update docs/api.md
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
* Update docs/api.md
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
* Update api.md
---------
Signed-off-by: Matt Williams <m@technovangelist.com>
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
Co-authored-by: Michael Chiang <mchiang0610@users.noreply.github.com>
2023-09-14 08:51:26 -07:00
Michael Yang
f89c23764b
Merge pull request #525 from jmorganca/mxyng/falcon-decode
...
fix: add falcon.go
2023-09-13 15:08:47 -07:00
Michael Yang
e6881cabd0
remove unused
2023-09-13 14:48:33 -07:00
Michael Yang
d028853879
fix: add falcon.go
2023-09-13 14:47:37 -07:00
Michael Yang
949553db23
Merge pull request #519 from jmorganca/mxyng/decode
...
Mxyng/decode
2023-09-13 12:43:57 -07:00
Michael Yang
0c5a454361
fix model type for 70b
2023-09-12 15:12:59 -07:00
Bruce MacDonald
f59c4d03f7
fix ggml arm64 cuda build ( #520 )
2023-09-12 17:06:48 -04:00
Michael Yang
7dee25a07f
fix falcon decode
...
get model and file type from bin file
2023-09-12 12:34:53 -07:00
Bruce MacDonald
f221637053
first pass at linux gpu support ( #454 )
...
* linux gpu support
* handle multiple gpus
* add cuda docker image (#488 )
---------
Co-authored-by: Michael Yang <mxyng@pm.me>
2023-09-12 11:04:35 -04:00
Patrick Devine
45ac07cd02
create the blobs directory correctly ( #508 )
2023-09-11 14:54:52 -07:00
Jeffrey Morgan
7d749cc787
fix darwin build script
2023-09-11 16:31:46 -04:00
Patrick Devine
e7e91cd71c
add autoprune to remove unused layers ( #491 )
2023-09-11 11:46:35 -07:00
Jeffrey Morgan
3920e15386
add model format to config layer ( #497 )
2023-09-09 17:53:44 -04:00
Michael Yang
41e976edde
Merge pull request #492 from jmorganca/mxyng/nil-pointer
...
fix nil pointer dereference
2023-09-07 17:25:23 -07:00
Michael Yang
de227b620f
fix nil pointer dereference
2023-09-07 17:24:31 -07:00
Michael Yang
63def6ca49
Merge pull request #487 from jmorganca/mxyng/dockerignore
...
update dockerignore
2023-09-07 14:16:17 -07:00
Michael Yang
738fe9c4aa
Merge pull request #486 from jmorganca/mxyng/fix-push
...
fix: retry push on expired token
2023-09-07 13:58:34 -07:00
Michael Yang
a8da0bacbe
update dockerignore
2023-09-07 13:36:25 -07:00
Michael Yang
bf146fb072
fix retry on unauthorized chunk
2023-09-07 12:02:04 -07:00
Michael Yang
f0f4943577
fix get auth token
2023-09-07 12:01:56 -07:00
Bruce MacDonald
09dd2aeff9
GGUF support ( #441 )
2023-09-07 13:55:37 -04:00
Alexander Pepper
07b4074e7b
[docs] Improve build instructions ( #482 )
...
Go is required and not installed by default.
2023-09-07 06:43:26 -04:00
Jeffrey Morgan
61dda6a5e0
set minimum CMAKE_OSX_DEPLOYMENT_TARGET
to 11.0
2023-09-06 19:56:50 -04:00
Michael Yang
e1f9ced568
Merge pull request #479 from jmorganca/mxyng/dockerfile
...
update dockerfile
2023-09-06 15:44:24 -07:00
Michael Yang
9795b43d93
update dockerfile
2023-09-06 15:31:25 -07:00
Michael Yang
0980d5c7e3
Merge pull request #478 from jmorganca/mxyng/cleanup
...
remove unused openssh key types
2023-09-06 15:18:54 -07:00
Michael Yang
0dae34b6a7
remove unused openssh key types
2023-09-06 14:34:09 -07:00
Michael Yang
83c6be1666
fix model manifests ( #477 )
2023-09-06 17:30:08 -04:00
Patrick Devine
1adfa67589
tighten up the error string for ollama show
flags ( #476 )
2023-09-06 13:38:49 -07:00
Patrick Devine
790d24eb7b
add show command ( #474 )
2023-09-06 11:04:17 -07:00
Jeffrey Morgan
7de300856b
use osPath
in gpu check
2023-09-05 21:52:21 -04:00
Jeffrey Morgan
213ffdb548
macos amd64
compatibility fixes
2023-09-05 21:33:31 -04:00
Michael Yang
d42d88386a
Merge pull request #473 from jmorganca/mxyng/fix-manifest-path
...
create manifests directory
2023-09-05 17:37:41 -07:00
Ackermann Yuriy
154f24af91
Added missing options params to the embeddings docs ( #472 )
2023-09-05 20:18:49 -04:00
Michael Yang
a1ecdd36d5
create manifests directory
2023-09-05 17:10:40 -07:00
Bruce MacDonald
d18282bfda
metal: add missing barriers for mul-mat ( #469 )
2023-09-05 19:37:13 -04:00
Michael Yang
9ae76ba8c9
Merge pull request #471 from jmorganca/mxyng/fix-empty-response
...
fix empty response
2023-09-05 15:23:05 -07:00
Michael Yang
2bc06565c7
fix empty response
2023-09-05 15:03:24 -07:00
Michael Yang
d1c2558f7e
Merge pull request #461 from jmorganca/mxyng/fix-inherit-params
...
fix inherit params
2023-09-05 12:30:23 -07:00
Michael Yang
7b5aefb427
Merge pull request #462 from jmorganca/mxyng/rm-marshal-prompt
...
remove marshalPrompt which is no longer needed
2023-09-05 11:48:41 -07:00
Michael Yang
06ef90c051
fix parameter inheritence
...
parameters are not inherited because they are processed differently from
other layer. fix this by explicitly merging the inherited params into
the new params. parameter values defined in the new modelfile will
override those defined in the inherited modelfile. array lists are
replaced instead of appended
2023-09-05 11:40:20 -07:00