Michael Yang
cb4a80b693
fix: regression unsupported metal types
...
omitting `--n-gpu-layers` means use metal on macos which isn't correct
since ollama uses `num_gpu=0` to explicitly disable gpu for file types
that are not implemented in metal
2023-10-16 14:37:20 -07:00
Bruce MacDonald
68d7255bd3
show request to server rather than local check ( #778 )
2023-10-16 17:27:25 -04:00
Michael Yang
9ef2fce33a
Merge pull request #768 from jmorganca/mxyng/bytes
...
fix memory check
2023-10-16 12:42:41 -07:00
Michael Yang
43eaba3d60
Merge pull request #787 from jmorganca/mxyng/server-version2
...
server: print version on start
2023-10-16 09:59:30 -07:00
Michael Yang
1af493c5a0
server: print version on start
2023-10-16 09:59:14 -07:00
Bruce MacDonald
a0c3e989de
deprecate modelfile embed command ( #759 )
2023-10-16 11:07:37 -04:00
Sergey Kostyaev
7af0fdce48
add ellama community integration
2023-10-16 16:39:10 +07:00
Arne Müller
ee94693b1a
handling unescaped json marshaling
2023-10-16 11:15:55 +02:00
Yiorgis Gozadinos
731dbdc1a5
Add oterm to community integrations
2023-10-15 23:21:17 +02:00
Jeffrey Morgan
06bcfbd629
cleanup docker section in readme
2023-10-15 02:33:25 -04:00
Jeffrey Morgan
7d7c2510f8
add docker exec
command to readme
2023-10-15 02:31:15 -04:00
Jeffrey Morgan
f9b2f999ac
update readme with docker
setup and link to import.md
2023-10-15 02:23:03 -04:00
Jeffrey Morgan
c416087339
import.md
: formatting and spelling
2023-10-15 01:39:46 -04:00
Jeffrey Morgan
6002cebd2c
import.md
: convert and quantize docs
2023-10-15 00:11:51 -04:00
Jeffrey Morgan
212bdc541c
import.md
: model architectures spelling
2023-10-15 00:07:58 -04:00
Jeffrey Morgan
dca6686273
add steps for creating a Modelfile and more example commands to import.md
2023-10-15 00:05:50 -04:00
Jeffrey Morgan
598621afab
add push script for docker images
2023-10-14 14:24:39 -04:00
Matt Williams
6479f49c09
Merge pull request #773 from jmorganca/mattw/howtoquant
...
add how to quantize doc
2023-10-14 08:29:39 -07:00
Matt Williams
b2974a7095
applied mikes comments
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-14 08:29:24 -07:00
Jeffrey Morgan
832b4db9d4
Use correct url for auto updates
2023-10-13 19:04:42 -04:00
Bruce MacDonald
c43873f33b
check update response ( #785 )
2023-10-13 18:05:46 -04:00
Michael Yang
11d82d7b9b
update checkvram
2023-10-13 14:47:29 -07:00
Michael Yang
36fe2deebf
only check system memory on macos
2023-10-13 14:47:29 -07:00
Michael Yang
4a8931f634
check total (system + video) memory
2023-10-13 14:47:29 -07:00
Michael Yang
bd6e38fb1a
refactor memory check
2023-10-13 14:47:29 -07:00
Michael Yang
92189a5855
fix memory check
2023-10-13 14:47:29 -07:00
Michael Yang
d790bf9916
Merge pull request #783 from jmorganca/mxyng/fix-gpu-offloading
...
fix: offloading on low end GPUs
2023-10-13 14:36:44 -07:00
Michael Yang
35afac099a
do not use gpu binary when num_gpu == 0
2023-10-13 14:32:12 -07:00
Michael Yang
811c3d1900
no gpu if vram < 2GB
2023-10-13 14:32:12 -07:00
Bruce MacDonald
3553d10769
check for newer updates ( #784 )
...
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>
2023-10-13 17:29:46 -04:00
Bruce MacDonald
6fe178134d
improve api error handling ( #781 )
...
- remove new lines from llama.cpp error messages relayed to client
- check api option types and return error on wrong type
- change num layers from 95% VRAM to 92% VRAM
2023-10-13 16:57:10 -04:00
Jeffrey Morgan
d890890f66
use lower glibc versions in Dockerfile.build
2023-10-13 01:06:19 -04:00
Jeffrey Morgan
89ba19feca
use Go 1.21.3
in Dockerfile
2023-10-12 23:23:12 -04:00
Jeffrey Morgan
6f58c77671
update Dockerfile.build
for linux binary builds
2023-10-12 22:14:20 -04:00
Matt Williams
3c975f898f
update doc to refer to docker image
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-12 15:57:50 -07:00
Matt Williams
9245c8a1df
add how to quantize doc
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-12 15:34:57 -07:00
Michael Yang
7a537cdca9
Merge pull request #770 from jmorganca/mxyng/fix-download
...
fix download
2023-10-12 12:56:43 -07:00
Michael Yang
257ffeb997
fix download
2023-10-12 12:52:43 -07:00
Matt Williams
9b513bb6b1
Merge pull request #753 from jmorganca/mattw/examplereorg
...
rename the examples to be more descriptive
2023-10-12 11:24:12 -07:00
Matt Williams
042100f797
final rename
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-12 11:23:41 -07:00
Bruce MacDonald
7804b8fab9
validate api options fields from map ( #711 )
2023-10-12 11:18:11 -04:00
Bruce MacDonald
56497663c8
relay model runner error message to client ( #720 )
...
* give direction to user when runner fails
* also relay errors from timeout
* increase timeout to 3 minutes
2023-10-12 11:16:37 -04:00
Matt Williams
e1afcb8af2
simple gen to simple
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-11 21:29:07 -07:00
Matt Williams
385eeea357
remove with
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-11 21:26:11 -07:00
Matt Williams
8a41b244e8
add golang gen
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-11 21:20:50 -07:00
Jeffrey Morgan
92578798bb
fix relative links in README.md
2023-10-11 19:24:06 -04:00
Michael Yang
788637918a
Merge pull request #760 from jmorganca/mxyng/more-downloads
...
Mxyng/more downloads
2023-10-11 14:33:10 -07:00
Michael Yang
c413a55093
download: handle inner errors
2023-10-11 14:15:30 -07:00
Michael Yang
630bb75d2a
dynamically size download parts based on file size
2023-10-11 14:10:25 -07:00
Michael Yang
a2055a1e93
update download
2023-10-11 14:10:25 -07:00