ollama/gpu
Daniel Hiltgen cd5c8f6471
Optimize container images for startup (#6547)
* Optimize container images for startup

This change adjusts how to handle runner payloads to support
container builds where we keep them extracted in the filesystem.
This makes it easier to optimize the cpu/cuda vs cpu/rocm images for
size, and should result in faster startup times for container images.

* Refactor payload logic and add buildx support for faster builds

* Move payloads around

* Review comments

* Converge to buildx based helper scripts

* Use docker buildx action for release
2024-09-12 12:10:30 -07:00
..
amd_common.go Move ollama executable out of bin dir (#6535) 2024-08-27 16:19:00 -07:00
amd_hip_windows.go lint 2024-08-01 17:06:06 -07:00
amd_linux.go Verify permissions for AMD GPU (#6736) 2024-09-11 11:38:25 -07:00
amd_windows.go Move ollama executable out of bin dir (#6535) 2024-08-27 16:19:00 -07:00
cpu_common.go Implement linux NUMA detection 2024-08-05 12:56:20 -07:00
cuda_common.go Use cuda v11 for driver 525 and older (#6620) 2024-09-03 17:15:31 -07:00
gpu.go Optimize container images for startup (#6547) 2024-09-12 12:10:30 -07:00
gpu_darwin.go Add Jetson cuda variants for arm 2024-08-19 09:38:53 -07:00
gpu_info.h lint 2024-08-01 17:06:06 -07:00
gpu_info_cudart.c Fix bad symbol load detection 2024-06-19 08:39:07 -07:00
gpu_info_cudart.h Refine GPU discovery to bootstrap once 2024-06-14 14:51:40 -07:00
gpu_info_darwin.h gpu: report system free memory instead of 0 (#5521) 2024-07-06 19:35:04 -04:00
gpu_info_darwin.m gpu: report system free memory instead of 0 (#5521) 2024-07-06 19:35:04 -04:00
gpu_info_nvcuda.c Better nvidia GPU discovery logging 2024-07-03 10:50:40 -07:00
gpu_info_nvcuda.h Better nvidia GPU discovery logging 2024-07-03 10:50:40 -07:00
gpu_info_nvml.c Fix bad symbol load detection 2024-06-19 08:39:07 -07:00
gpu_info_nvml.h Reintroduce nvidia nvml library for windows 2024-06-14 14:51:40 -07:00
gpu_info_oneapi.c get real func ptr. 2024-06-19 09:00:51 +08:00
gpu_info_oneapi.h review comments and coverage 2024-06-14 14:55:50 -07:00
gpu_linux.go Refactor linux packaging 2024-08-19 09:38:53 -07:00
gpu_oneapi.go support ollama run on Intel GPUs 2024-05-24 11:18:27 +08:00
gpu_test.go gpu: Group GPU Library sets by variant (#6483) 2024-08-23 15:11:56 -07:00
gpu_windows.go lint 2024-08-01 17:06:06 -07:00
types.go gpu: Group GPU Library sets by variant (#6483) 2024-08-23 15:11:56 -07:00