If we try to load the CUDA library on an old GPU, it panics and crashes the server. This checks the compute capability before we load the library so we can gracefully fall back to CPU mode.
* gpu: read memory info from all cuda devices * add `LOOKUP_SIZE` constant * better constant name * address comments