Jeffrey Morgan
92578798bb
fix relative links in README.md
2023-10-11 19:24:06 -04:00
Michael Yang
788637918a
Merge pull request #760 from jmorganca/mxyng/more-downloads
...
Mxyng/more downloads
2023-10-11 14:33:10 -07:00
Michael Yang
c413a55093
download: handle inner errors
2023-10-11 14:15:30 -07:00
Michael Yang
630bb75d2a
dynamically size download parts based on file size
2023-10-11 14:10:25 -07:00
Michael Yang
a2055a1e93
update download
2023-10-11 14:10:25 -07:00
Michael Yang
b599946b74
add format bytes
2023-10-11 14:08:23 -07:00
Michael Yang
aca2d65b82
Merge pull request #757 from jmorganca/mxyng/format-time
...
cleanup format time
2023-10-11 11:12:29 -07:00
Michael Yang
b5e08e3373
cleanup format time
2023-10-11 11:09:27 -07:00
Bruce MacDonald
274d5a5fdf
optional parameter to not stream response ( #639 )
...
* update streaming request accept header
* add optional stream param to request bodies
2023-10-11 12:54:27 -04:00
Matt Williams
fc6b49be32
add ts alternate to python langchain simplegen
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-11 09:50:15 -07:00
Bruce MacDonald
77295f716e
prevent waiting on exited command ( #752 )
...
* prevent waiting on exited command
* close llama runner once
2023-10-11 12:32:13 -04:00
Matt Williams
615f7d1dea
cleanup readme.
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-11 06:13:29 -07:00
Matt Williams
cdf5e106ae
rename dirs
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-11 06:10:24 -07:00
Matt Williams
a85329f59a
rename the models to be more descriptive
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-10 17:40:02 -07:00
Bruce MacDonald
f2ba1311aa
improve vram safety with 5% vram memory buffer ( #724 )
...
* check free memory not total
* wait for subprocess to exit
2023-10-10 16:16:09 -04:00
Jeffrey Morgan
65dcd0ce35
always cleanup blob download ( #747 )
2023-10-10 13:12:29 -04:00
Michael Yang
0040f543a2
Merge pull request #743 from jmorganca/mxyng/http-proxy
...
handle upstream proxies
2023-10-10 09:59:06 -07:00
Matt Williams
767f9bdbbb
Merge pull request #585 from jmorganca/matt/examplementors
...
add the example for ask the mentors
2023-10-09 13:58:14 -07:00
Costa Alexoglou
f7f5169c94
Update api.md ( #741 )
...
Avoid triple ticks in visual editor and also copied in clipboard.
2023-10-09 16:01:46 -04:00
Michael Yang
2cfffea02e
handle client proxy
2023-10-09 12:33:47 -07:00
Michael Yang
f6e98334e4
handle upstream proxies
2023-10-09 11:42:36 -07:00
Jeffrey Morgan
ab0668293c
llm: fix build on amd64
2023-10-06 14:39:54 -07:00
Bruce MacDonald
af4cf55884
not found error before pulling model ( #718 )
2023-10-06 16:06:20 -04:00
Bruce MacDonald
d6786f2945
add feedback for reading model metadata ( #722 )
2023-10-06 16:05:32 -04:00
Michael Yang
38dc2f79bc
Merge pull request #626 from jmorganca/mxyng/concurrent-downloads
...
parallel chunked downloads
2023-10-06 13:01:29 -07:00
Michael Yang
cb961c87ca
Merge pull request #679 from jamesbraza/modelfile-docs
...
`Modelfile` syntax highlighting
2023-10-06 12:59:45 -07:00
Michael Yang
0560b28a8d
names
2023-10-06 12:56:56 -07:00
Michael Yang
10199c5987
replace done channel with file check
2023-10-06 12:56:56 -07:00
Michael Yang
288814d3e4
fix ref counts
2023-10-06 12:56:43 -07:00
Michael Yang
04733438da
check head request response
2023-10-06 12:56:43 -07:00
Michael Yang
711e891f0f
fix resumable downloads
...
glob returns files in lexical order which is not appropriate when
rebuilding the parts list
2023-10-06 12:56:43 -07:00
Michael Yang
090d08422b
handle unexpected eofs
2023-10-06 12:56:43 -07:00
Michael Yang
5b84404c64
handle concurrent requests for the same blobs
2023-10-06 12:56:43 -07:00
Michael Yang
8544edca21
parallel chunked downloads
2023-10-06 12:56:43 -07:00
Bruce MacDonald
5d22319a2c
rename server subprocess ( #700 )
...
- this makes it easier to see that the subprocess is associated with ollama
2023-10-06 10:15:42 -04:00
Bruce MacDonald
2130c0708b
output type parsed from modelfile ( #678 )
2023-10-05 14:58:04 -04:00
Patrick Devine
61ff1946e6
revise help text ( #706 )
2023-10-05 11:36:07 -07:00
Bruce MacDonald
d06bc0cb6e
enable q8, q5, 5_1, and f32 for linux gpu ( #699 )
2023-10-05 12:53:47 -04:00
Alexander F. Rødseth
d104b7e997
Fix go test./... issue: fmt.Println arg list ends with redundant newline ( #705 )
2023-10-05 11:11:04 -04:00
Bruce MacDonald
9e2de1bd2c
increase streaming buffer size ( #692 )
2023-10-04 14:09:00 -04:00
Jeffrey Morgan
dc87e9c9ae
update Dockerfile
to pass GOFLAGS
2023-10-03 07:05:15 -07:00
Michael Yang
367cb68dc1
Merge pull request #686 from jmorganca/mxyng/starcoder
...
decode starcoder
2023-10-02 22:47:19 -07:00
Michael Yang
c02c0cd483
starcoder
2023-10-02 19:56:51 -07:00
Patrick Devine
1852755154
show a default message when license/parameters/system prompt/template aren't specified ( #681 )
2023-10-02 14:34:52 -07:00
James Braza
6f2ce74231
Got rif of all caps to show it can be lower case
2023-10-02 13:54:27 -07:00
James Braza
6edcc5c79f
Using code highlighting syntax around Modelfile
2023-10-02 13:46:05 -07:00
Bruce MacDonald
b1f7123301
clean up num_gpu calculation code ( #673 )
2023-10-02 14:53:42 -04:00
Bruce MacDonald
1fbf3585d6
Relay default values to llama runner ( #672 )
...
* include seed in params for llama.cpp server and remove empty filter for temp
* relay default predict options to llama.cpp
- reorganize options to match predict request for readability
* omit empty stop
---------
Co-authored-by: hallh <hallh@users.noreply.github.com>
2023-10-02 14:53:16 -04:00
Patrick Devine
99d5161e8a
don't wordwrap when stdout is redirected or piped ( #662 )
2023-10-02 11:50:55 -07:00
Michael
ea8380be45
add community project: Chatbot Ollama
...
add community project: Chatbot Ollama by @ivanfioravanti
2023-10-02 09:04:31 -07:00