forked from EleutherAI/gpt-neox
-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The .grad attribute of a Tensor that is not a leaf Tensor is being accessed #16
Milestone
Comments
The reason the warning shows up is because images are floating point and deepspeed sets requires_grad to True for input images: https://github.com/EleutherAI/DeeperSpeed/blob/main/deepspeed/runtime/pipe/engine.py#L764 We can safely ignore this :) Though this might lead to high memory consumption, will link required deepspeed PR here to avoid excessive memory consumption. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Description
When training with what ever config, get warnning as
This indicate that some non-leaf tensor is being accessed.
there is no such warnning in pure gpt-neox, and it still occur when I set add_adapter=False, therefore related to image_prefix at least.
The text was updated successfully, but these errors were encountered: