ollama/server
Jeffrey Morgan 38255d2af1
Use flash attention flag for now (#4580)
* put flash attention behind flag for now

* add test

* remove print

* up timeout for sheduler tests
2024-05-22 21:52:09 -07:00
..
envconfig Use flash attention flag for now (#4580) 2024-05-22 21:52:09 -07:00
auth.go Revert "use post token" 2024-05-11 22:19:14 -07:00
download.go return on part done 2024-05-14 13:04:30 -07:00
fixblobs.go server: replace blob prefix separator from ':' to '-' (#3146) 2024-03-14 20:18:06 -07:00
fixblobs_test.go server: replace blob prefix separator from ':' to '-' (#3146) 2024-03-14 20:18:06 -07:00
images.go fix quantize file types 2024-05-20 15:22:11 -07:00
layer.go cache and reuse intermediate blobs 2024-05-20 13:25:10 -07:00
manifest.go update list handler to use model.Name 2024-05-07 09:38:45 -07:00
model.go tidy intermediate blobs 2024-05-20 15:15:06 -07:00
modelpath.go prune partial downloads (#4272) 2024-05-09 16:35:20 -07:00
modelpath_test.go validate the format of the digest when getting the model path (#4175) 2024-05-05 11:46:12 -07:00
prompt.go change github.com/jmorganca/ollama to github.com/ollama/ollama (#3347) 2024-03-26 13:04:17 -07:00
prompt_test.go change github.com/jmorganca/ollama to github.com/ollama/ollama (#3347) 2024-03-26 13:04:17 -07:00
routes.go tidy intermediate blobs 2024-05-20 15:15:06 -07:00
routes_test.go Move the parser back + handle utf16 files (#4533) 2024-05-20 11:26:45 -07:00
sched.go Correct typo in error message (#4535) 2024-05-21 13:39:01 -07:00
sched_test.go Use flash attention flag for now (#4580) 2024-05-22 21:52:09 -07:00
upload.go change github.com/jmorganca/ollama to github.com/ollama/ollama (#3347) 2024-03-26 13:04:17 -07:00