Jeffrey Morgan
a7dad24d92
add error for falcon
and starcoder
vocab compatibility ( #844 )
...
add error for falcon and starcoder vocab compatibility
---------
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
2023-10-19 12:18:31 -04:00
Jeffrey Morgan
6b213216d5
Update import.md
2023-10-19 12:17:36 -04:00
Bruce MacDonald
fe6f3b48f7
do not reload the running llm when runtime params change ( #840 )
...
- only reload the running llm if the model has changed, or the options for loading the running model have changed
- rename loaded llm to runner to differentiate from loaded model image
- remove logic which keeps the first system prompt in the generation context
2023-10-19 10:39:58 -04:00
Michael Yang
36c88cb9db
cmd: set ExactArgs
2023-10-18 14:40:48 -07:00
Michael Yang
235e43d7f6
Merge pull request #833 from discovertomorrow/leadingspace
...
Fix Issue with Leading Whitespaces in Decoded Context
2023-10-18 13:52:48 -07:00
Arne Müller
730996e530
use TrimPrefix instead of TrimLeft
2023-10-18 22:51:30 +02:00
Arne Müller
ce6197a8e0
removed redundant strings.CutPrefix from Decode
2023-10-18 22:47:20 +02:00
Arne Müller
46b9953f32
use strings.TrimLeft to remove spaces
2023-10-18 22:41:19 +02:00
Michael Yang
4dcceeffb7
let the template do the work
2023-10-18 13:12:00 -07:00
Michael Yang
019e4a4558
image: show parameters
2023-10-18 13:12:00 -07:00
Michael Yang
627d04d927
Merge pull request #827 from jmorganca/mxyng/template-adapters
...
model: native gotemplate adapter template
2023-10-18 13:11:25 -07:00
Michael Yang
940e8ebec3
Merge pull request #826 from jmorganca/mxyng/template-system
...
show: no template system if empty
2023-10-18 13:11:09 -07:00
Bruce MacDonald
565648f3f7
relay CUDA errors to the client ( #825 )
2023-10-18 15:36:56 -04:00
Arne Müller
90c49bed57
moved removal of leading space into Predict
2023-10-18 20:08:26 +02:00
Michael Yang
3a2477174f
Merge pull request #822 from ggozad/fix-tags-api
...
Fix /api/tags for no models.
2023-10-18 09:34:00 -07:00
Yiorgis Gozadinos
8c6c2cbc8c
When the .ollama folder is broken or there are no models return an empty list on /api/tags
2023-10-18 08:23:20 +02:00
Arne Müller
5dc0cff459
fix whitespace removal
2023-10-18 08:15:27 +02:00
Matt Williams
c5c8b4b16a
added python rag news summary
...
Signed-off-by: Matt Williams <m@technovangelist.com>
2023-10-17 16:41:28 -07:00
Michael Yang
8299bf76ed
model: native gotemplate adapter template
2023-10-17 15:28:38 -07:00
Michael Yang
ee4979e510
show: no template system if empty
2023-10-17 15:25:43 -07:00
Michael Yang
08b0e04f40
Merge pull request #813 from jmorganca/mxyng/llama
...
refactor llm/llama.go
2023-10-17 14:05:58 -07:00
Michael Yang
b36b0b71f8
use cut prefix
2023-10-17 14:01:39 -07:00
Michael Yang
094df37563
remove unused struct
2023-10-17 14:01:38 -07:00
Bruce MacDonald
f3648fd206
Update llama.cpp gguf to latest ( #710 )
2023-10-17 16:55:16 -04:00
Bruce MacDonald
bd93a94abd
fix MB VRAM log output ( #824 )
2023-10-17 15:35:16 -04:00
Michael Yang
f55bdb6f10
Merge pull request #799 from deichbewohner/jsonmarshaling
...
Fix JSON Marshal Escaping for Special Characters
2023-10-17 08:46:02 -07:00
Michael Yang
2870a9bfc8
Merge pull request #812 from jmorganca/mxyng/fix-format-string
...
fix: wrong format string type
2023-10-17 08:40:49 -07:00
Michael Yang
c031c211d1
Merge pull request #809 from jmorganca/mxyng/fix-gpu
...
fix: regression unsupported metal types
2023-10-17 08:40:40 -07:00
Andreas Wäscher
68391b0055
Add OllamaSharp for .NET ( #811 )
2023-10-17 11:31:48 -04:00
Alexander F. Rødseth
b7e137323a
Fix a typo ( #818 )
2023-10-17 09:00:15 -04:00
Arne Müller
8fa3f366ad
Removed newline trimming and used buffer directly in POST request.
2023-10-17 08:17:35 +02:00
Michael Yang
fddb303f23
fix: format string wrong type
2023-10-16 16:14:28 -07:00
Michael Yang
ad5ee20c7b
Merge pull request #794 from ggozad/add_oterm
...
Add oterm to community integrations
2023-10-16 15:51:55 -07:00
Michael Yang
785b4eb5bf
Merge branch 'main' into add_oterm
2023-10-16 15:51:44 -07:00
Michael Yang
16ede1b30b
Merge pull request #801 from s-kostyaev/add-ellama-community-integration
...
Add ellama community integration
2023-10-16 15:51:25 -07:00
Michael Yang
17d6bbbb2a
Merge pull request #810 from vieux/patch-1
...
Update install.sh
2023-10-16 15:50:57 -07:00
Victor Vieux
6481b7f34c
Update install.sh, avoid ARCH: unbound variable
2023-10-16 14:40:24 -07:00
Michael Yang
cb4a80b693
fix: regression unsupported metal types
...
omitting `--n-gpu-layers` means use metal on macos which isn't correct
since ollama uses `num_gpu=0` to explicitly disable gpu for file types
that are not implemented in metal
2023-10-16 14:37:20 -07:00
Bruce MacDonald
68d7255bd3
show request to server rather than local check ( #778 )
2023-10-16 17:27:25 -04:00
Michael Yang
9ef2fce33a
Merge pull request #768 from jmorganca/mxyng/bytes
...
fix memory check
2023-10-16 12:42:41 -07:00
Michael Yang
43eaba3d60
Merge pull request #787 from jmorganca/mxyng/server-version2
...
server: print version on start
2023-10-16 09:59:30 -07:00
Michael Yang
1af493c5a0
server: print version on start
2023-10-16 09:59:14 -07:00
Bruce MacDonald
a0c3e989de
deprecate modelfile embed command ( #759 )
2023-10-16 11:07:37 -04:00
Sergey Kostyaev
7af0fdce48
add ellama community integration
2023-10-16 16:39:10 +07:00
Arne Müller
ee94693b1a
handling unescaped json marshaling
2023-10-16 11:15:55 +02:00
Yiorgis Gozadinos
731dbdc1a5
Add oterm to community integrations
2023-10-15 23:21:17 +02:00
Jeffrey Morgan
06bcfbd629
cleanup docker section in readme
2023-10-15 02:33:25 -04:00
Jeffrey Morgan
7d7c2510f8
add docker exec
command to readme
2023-10-15 02:31:15 -04:00
Jeffrey Morgan
f9b2f999ac
update readme with docker
setup and link to import.md
2023-10-15 02:23:03 -04:00
Jeffrey Morgan
c416087339
import.md
: formatting and spelling
2023-10-15 01:39:46 -04:00