* gpu: read memory info from all cuda devices * add `LOOKUP_SIZE` constant * better constant name * address comments