Acknowledge `CUDA_DEVICE_QUERY` during GPU selection #216

denisalevi · 2021-07-14T14:01:15Z

Before, CUDA_DEVICE_QUERY had not effect for GPU selection in brian2cuda. This PR should fix that.

This needs some tinkering around with the different IDs (the ones reported by nvidia-smi, which ignores CUDA_VISIBLE_DEVICES) and the ones defined by CUDA_VISIBLE_DEVICES (which e.g. deviceQuery takes into account).

Here is the updated doc for the gpu_id preferences. This is how it should work. What is missing is that with prefs....gpu_id = None and CUDA_VISIBLE_DEVICES set, it still ignored CUDA_VISIBLE_DEVICES currently.

    The ID of the GPU that should be used for code execution. Default value is
    `None`, in which case the GPU with the highest compute capability and lowest ID
    is used.

    If this preference is set, it has to be the ID reported by `nvidia-smi`, which
    ignores the environment variable `CUDA_VISIBLE_DEVICES`.

    If this preference isn't set, `CUDA_VISIBLE_DEVICES` is not ignored. E.g. with
    `CUDA_DEVICE_QUERY=1,2` only GPUs 1 and 2 will be considered during GPU
    detection.

We use `nvidia-smi -L` to detect all available GPUs. `nvidia-smi` displays all GPUs independent of `CUDA_DEVICE_QUERY`, hence setting it had not effect. Now, `CUDA_DEVICE_QUERY=1` will detect GPU 1 as GPU 0.

This is not implemented yet, but shows how it should work.

CUDA_VISIBLE_DEVICES will now precede any other options. That means if `gpu_id` is set as `prefs`, it will choose from the visible devices.

denisalevi · 2021-07-14T16:44:40Z

I thought cudaSetDevice would ignore CUDA_VISIBLE_DEVICES. But it doesn't. Hence the most obvious and easiest solution here is to never ignore CUDA_VISIBLE_DEVICES. This is implemented now. The only place where this was failing was when detecting all available GPUs using nvidia-smi, which ignores CUDA_VISIBLE_DEVICES. Now, detection of all devices checks CUDA_VISIBLE_DEVICES as well.

Acknowledge CUDA_DEVICE_QUERY during GPU selection

c86b2a6

We use `nvidia-smi -L` to detect all available GPUs. `nvidia-smi` displays all GPUs independent of `CUDA_DEVICE_QUERY`, hence setting it had not effect. Now, `CUDA_DEVICE_QUERY=1` will detect GPU 1 as GPU 0.

denisalevi force-pushed the fix-gpu-detection branch from 3346b2c to c86b2a6 Compare July 14, 2021 14:07

denisalevi added 3 commits July 14, 2021 16:50

Remove whitespace

8ee222e

Update gpu_id prefs documentation

888bd1a

This is not implemented yet, but shows how it should work.

Fix CUDA_VISIBLE_DEVICES behavior

b8f6de7

CUDA_VISIBLE_DEVICES will now precede any other options. That means if `gpu_id` is set as `prefs`, it will choose from the visible devices.

denisalevi merged commit ad6b820 into master Jul 14, 2021

denisalevi deleted the fix-gpu-detection branch July 14, 2021 16:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Acknowledge `CUDA_DEVICE_QUERY` during GPU selection #216

Acknowledge `CUDA_DEVICE_QUERY` during GPU selection #216

denisalevi commented Jul 14, 2021 •

edited

Loading

denisalevi commented Jul 14, 2021

Acknowledge CUDA_DEVICE_QUERY during GPU selection #216

Acknowledge CUDA_DEVICE_QUERY during GPU selection #216

Conversation

denisalevi commented Jul 14, 2021 • edited Loading

denisalevi commented Jul 14, 2021

Acknowledge `CUDA_DEVICE_QUERY` during GPU selection #216

Acknowledge `CUDA_DEVICE_QUERY` during GPU selection #216

denisalevi commented Jul 14, 2021 •

edited

Loading