9929751cc8
Until ROCm v6.2 ships, we wont be able to get accurate free memory reporting on windows, which makes automatic concurrency too risky. Users can still opt-in but will need to pay attention to model sizes otherwise they may thrash/page VRAM or cause OOM crashes. All other platforms and GPUs have accurate VRAM reporting wired up now, so we can turn on concurrency by default. |
||
---|---|---|
.. | ||
auth.go | ||
download.go | ||
fixblobs.go | ||
fixblobs_test.go | ||
images.go | ||
layer.go | ||
manifest.go | ||
manifest_test.go | ||
model.go | ||
modelpath.go | ||
modelpath_test.go | ||
prompt.go | ||
prompt_test.go | ||
routes.go | ||
routes_create_test.go | ||
routes_delete_test.go | ||
routes_list_test.go | ||
routes_test.go | ||
sched.go | ||
sched_test.go | ||
upload.go |