-
-
Notifications
You must be signed in to change notification settings - Fork 68
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
In-Place Reduction for NCCL #259
Comments
Ah yeah I see that (cuda docs: https://docs.nvidia.com/deeplearning/nccl/user-guide/docs/api/colls.html#c.ncclReduce) I think in this case due to rust's borrow rules it'd probably be easiest to just add |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
NCCL supports all-reduce in place, however
Comm::all_reduce
takes in a&CudaSlice
to read from and a&mut CudaSlice
to write into, which doesn't allow in-place reduction.The text was updated successfully, but these errors were encountered: