Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUBLAS: nrm2 support for StridedCuArray with length requiring Int64 #2268

Closed
tipfom opened this issue Feb 15, 2024 · 1 comment · Fixed by #2270
Closed

CUBLAS: nrm2 support for StridedCuArray with length requiring Int64 #2268

tipfom opened this issue Feb 15, 2024 · 1 comment · Fixed by #2270
Labels
cuda libraries Stuff about CUDA library wrappers. enhancement New feature or request

Comments

@tipfom
Copy link

tipfom commented Feb 15, 2024

Currently, the CUDA.CUBLAS.nrm2 wrapper function fails, when the length of the array is greater than supported by Int32.
This also causes the LinearAlgebra.norm implementation to fail causing an error with the following signature: "InexactError: trunc(Int32, 4294967296)".

For me, appending the nrm2 wrapper definition to include the 64 bit variants of the nrm2 functions, namely cublasDnrm2_v2_64, cublasSnrm2_v2_64, cublasDznrm2_v2_64, cublasScnrm2_v2_64, presents the best solution.

@tipfom tipfom added the enhancement New feature or request label Feb 15, 2024
@tipfom
Copy link
Author

tipfom commented Feb 15, 2024

The PR #2269 provides a suggested fix :)

@maleadt maleadt added the cuda libraries Stuff about CUDA library wrappers. label Feb 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cuda libraries Stuff about CUDA library wrappers. enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants