Michael Yang
bc9573dcb1
Merge pull request #530 from jmorganca/mxyng/progresswriter
...
implement ProgressWriter
2023-09-15 12:43:46 -07:00
Michael Yang
e53bc57d4d
split uploadBlobChunked
2023-09-14 17:22:05 -07:00
Michael Yang
f0b398d17f
implement ProgressWriter
2023-09-14 17:22:04 -07:00
Patrick Devine
8efbc5df55
DRAFT: add a simple python client to access ollama ( #522 )
2023-09-14 16:37:38 -07:00
Michael Yang
ccc3e9ac6d
Merge pull request #531 from jmorganca/mxyng/content-length
...
set request.ContentLength
2023-09-14 13:33:11 -07:00
Michael Yang
daa4f096f9
set request.ContentLength
...
This informs the HTTP client the content length is known and disables
chunked Transfer-Encoding
2023-09-14 13:32:44 -07:00
Michael Yang
3ee85f1c6c
Merge pull request #526 from jmorganca/mxyng/cleanup
...
remove unused
2023-09-14 13:10:59 -07:00
Bruce MacDonald
2540c9181c
support for packaging in multiple cuda runners ( #509 )
...
* enable packaging multiple cuda versions
* use nvcc cuda version if available
---------
Co-authored-by: Michael Yang <mxyng@pm.me>
2023-09-14 15:08:13 -04:00
Michael Yang
83ffb154bc
Merge pull request #507 from jmorganca/mxyng/build
...
update docker image
2023-09-14 11:25:59 -07:00
Michael Yang
9aa192c812
update cuda docker image
2023-09-14 11:25:20 -07:00
Matt Williams
fc8707686f
Update API docs ( #527 )
...
* Update API docs
Signed-off-by: Matt Williams <m@technovangelist.com>
* strange TOC was getting auto generated
Signed-off-by: Matt Williams <m@technovangelist.com>
* Update docs/api.md
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
* Update docs/api.md
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
* Update docs/api.md
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
* Update api.md
---------
Signed-off-by: Matt Williams <m@technovangelist.com>
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
Co-authored-by: Michael Chiang <mchiang0610@users.noreply.github.com>
2023-09-14 08:51:26 -07:00
Michael Yang
f89c23764b
Merge pull request #525 from jmorganca/mxyng/falcon-decode
...
fix: add falcon.go
2023-09-13 15:08:47 -07:00
Michael Yang
e6881cabd0
remove unused
2023-09-13 14:48:33 -07:00
Michael Yang
d028853879
fix: add falcon.go
2023-09-13 14:47:37 -07:00
Michael Yang
949553db23
Merge pull request #519 from jmorganca/mxyng/decode
...
Mxyng/decode
2023-09-13 12:43:57 -07:00
Michael Yang
0c5a454361
fix model type for 70b
2023-09-12 15:12:59 -07:00
Bruce MacDonald
f59c4d03f7
fix ggml arm64 cuda build ( #520 )
2023-09-12 17:06:48 -04:00
Michael Yang
7dee25a07f
fix falcon decode
...
get model and file type from bin file
2023-09-12 12:34:53 -07:00
Bruce MacDonald
f221637053
first pass at linux gpu support ( #454 )
...
* linux gpu support
* handle multiple gpus
* add cuda docker image (#488 )
---------
Co-authored-by: Michael Yang <mxyng@pm.me>
2023-09-12 11:04:35 -04:00
Patrick Devine
45ac07cd02
create the blobs directory correctly ( #508 )
2023-09-11 14:54:52 -07:00
Jeffrey Morgan
7d749cc787
fix darwin build script
2023-09-11 16:31:46 -04:00
Patrick Devine
e7e91cd71c
add autoprune to remove unused layers ( #491 )
2023-09-11 11:46:35 -07:00
Jeffrey Morgan
3920e15386
add model format to config layer ( #497 )
2023-09-09 17:53:44 -04:00
Michael Yang
41e976edde
Merge pull request #492 from jmorganca/mxyng/nil-pointer
...
fix nil pointer dereference
2023-09-07 17:25:23 -07:00
Michael Yang
de227b620f
fix nil pointer dereference
2023-09-07 17:24:31 -07:00
Michael Yang
63def6ca49
Merge pull request #487 from jmorganca/mxyng/dockerignore
...
update dockerignore
2023-09-07 14:16:17 -07:00
Michael Yang
738fe9c4aa
Merge pull request #486 from jmorganca/mxyng/fix-push
...
fix: retry push on expired token
2023-09-07 13:58:34 -07:00
Michael Yang
a8da0bacbe
update dockerignore
2023-09-07 13:36:25 -07:00
Michael Yang
bf146fb072
fix retry on unauthorized chunk
2023-09-07 12:02:04 -07:00
Michael Yang
f0f4943577
fix get auth token
2023-09-07 12:01:56 -07:00
Bruce MacDonald
09dd2aeff9
GGUF support ( #441 )
2023-09-07 13:55:37 -04:00
Alexander Pepper
07b4074e7b
[docs] Improve build instructions ( #482 )
...
Go is required and not installed by default.
2023-09-07 06:43:26 -04:00
Jeffrey Morgan
61dda6a5e0
set minimum CMAKE_OSX_DEPLOYMENT_TARGET
to 11.0
2023-09-06 19:56:50 -04:00
Michael Yang
e1f9ced568
Merge pull request #479 from jmorganca/mxyng/dockerfile
...
update dockerfile
2023-09-06 15:44:24 -07:00
Michael Yang
9795b43d93
update dockerfile
2023-09-06 15:31:25 -07:00
Michael Yang
0980d5c7e3
Merge pull request #478 from jmorganca/mxyng/cleanup
...
remove unused openssh key types
2023-09-06 15:18:54 -07:00
Michael Yang
0dae34b6a7
remove unused openssh key types
2023-09-06 14:34:09 -07:00
Michael Yang
83c6be1666
fix model manifests ( #477 )
2023-09-06 17:30:08 -04:00
Patrick Devine
1adfa67589
tighten up the error string for ollama show
flags ( #476 )
2023-09-06 13:38:49 -07:00
Patrick Devine
790d24eb7b
add show command ( #474 )
2023-09-06 11:04:17 -07:00
Jeffrey Morgan
7de300856b
use osPath
in gpu check
2023-09-05 21:52:21 -04:00
Jeffrey Morgan
213ffdb548
macos amd64
compatibility fixes
2023-09-05 21:33:31 -04:00
Michael Yang
d42d88386a
Merge pull request #473 from jmorganca/mxyng/fix-manifest-path
...
create manifests directory
2023-09-05 17:37:41 -07:00
Ackermann Yuriy
154f24af91
Added missing options params to the embeddings docs ( #472 )
2023-09-05 20:18:49 -04:00
Michael Yang
a1ecdd36d5
create manifests directory
2023-09-05 17:10:40 -07:00
Bruce MacDonald
d18282bfda
metal: add missing barriers for mul-mat ( #469 )
2023-09-05 19:37:13 -04:00
Michael Yang
9ae76ba8c9
Merge pull request #471 from jmorganca/mxyng/fix-empty-response
...
fix empty response
2023-09-05 15:23:05 -07:00
Michael Yang
2bc06565c7
fix empty response
2023-09-05 15:03:24 -07:00
Michael Yang
d1c2558f7e
Merge pull request #461 from jmorganca/mxyng/fix-inherit-params
...
fix inherit params
2023-09-05 12:30:23 -07:00
Michael Yang
7b5aefb427
Merge pull request #462 from jmorganca/mxyng/rm-marshal-prompt
...
remove marshalPrompt which is no longer needed
2023-09-05 11:48:41 -07:00