ollama/server
Daniel Hiltgen d6e3b64582 Fix concurrency for CPU mode
Prior refactoring passes accidentally removed the logic to bypass VRAM
checks for CPU loads.  This adds that back, along with test coverage.

This also fixes loaded map access in the unit test to be behind the mutex which was
likely the cause of various flakes in the tests.
2024-04-28 13:42:39 -07:00
..
auth.go change github.com/jmorganca/ollama to github.com/ollama/ollama (#3347) 2024-03-26 13:04:17 -07:00
download.go server: provide helpful workaround hint when stalling on pull (#3584) 2024-04-10 16:24:37 -07:00
fixblobs.go server: replace blob prefix separator from ':' to '-' (#3146) 2024-03-14 20:18:06 -07:00
fixblobs_test.go server: replace blob prefix separator from ':' to '-' (#3146) 2024-03-14 20:18:06 -07:00
images.go types/model: overhaul Name and Digest types (#3924) 2024-04-26 13:08:32 -07:00
layers.go server: replace blob prefix separator from ':' to '-' (#3146) 2024-03-14 20:18:06 -07:00
manifests.go add .golangci.yaml 2024-01-09 09:36:58 -08:00
modelpath.go server: replace blob prefix separator from ':' to '-' (#3146) 2024-03-14 20:18:06 -07:00
modelpath_test.go use int64 consistently 2023-09-28 11:07:24 -07:00
prompt.go change github.com/jmorganca/ollama to github.com/ollama/ollama (#3347) 2024-03-26 13:04:17 -07:00
prompt_test.go change github.com/jmorganca/ollama to github.com/ollama/ollama (#3347) 2024-03-26 13:04:17 -07:00
routes.go return code 499 when user cancels request while a model is loading (#3955) 2024-04-26 17:38:29 -04:00
routes_test.go cgo quantize 2024-04-08 15:31:08 -07:00
sched.go Fix concurrency for CPU mode 2024-04-28 13:42:39 -07:00
sched_test.go Fix concurrency for CPU mode 2024-04-28 13:42:39 -07:00
upload.go change github.com/jmorganca/ollama to github.com/ollama/ollama (#3347) 2024-03-26 13:04:17 -07:00