Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

prov/opx: Add GDRCopy logging and failure path #10221

Conversation

charlesshereda
Copy link
Contributor

Added messaging and some failures when GDRCopy is/is not being used and is/is not available.

@charlesshereda
Copy link
Contributor Author

Hoping this can still make it into 1.22...


if (gdrcopy_enabled == 1) {
if (use_gdrcopy == 1) {
FI_WARN(&fi_opx_provider, FI_LOG_FABRIC, "GDRCopy has been requested and is available. If you wish to explicity disable GDRCopy, set FI_HMEM_CUDA_USE_GDRCOPY=0\n");
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why produce a warning when the result is as expected?

goto err;
} else {
/* gdrcopy_enabled = 0 and use_gdr = 0 */
FI_WARN(&fi_opx_provider, FI_LOG_FABRIC, "If GDRCopy is installed on this system, change FI_HMEM_CUDA_USE_GDRCOPY=0 to FI_HMEM_CUDA_USE_GDRCOPY=1 to enable GDRCopy. \n");
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same here, this path is only reached when gdrcopy is explicitly disabled.

@j-xiong
Copy link
Contributor

j-xiong commented Jul 25, 2024

BTW, this won't be able to get into 1.22.0.

@charlesshereda
Copy link
Contributor Author

BTW, this won't be able to get into 1.22.0.

Ok, understood.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants