This refines the gpu package error handling and fixes a bug with the system memory lookup on windows.
If someone checks out the ollama repo and doesn't install the CUDA library, this will ensure they can build a CPU only version