[autograd] Do not detach when unpacking tensors that do not require grad #127959

soulitzer · 2024-06-04T20:43:15Z

Stack from ghstack (oldest at bottom):

In this PR:

Ensure that if a tensor not requiring grad is saved for backward unpacking does not trigger a detach (unless the user installs a saved tensor pack hook that returns a tensor requiring grad).
Update non-reentrant checkpoint to also no longer detach for this case.

Alternatives:

For custom autograd Function, you could directly save on ctx to work around this, but that would not work for when we switch to using custom ops.

[ghstack-poisoned]

pytorch-bot · 2024-06-04T20:43:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/127959

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

NVIDIA runners have a blacklog of 2000+ jobs

✅ You can merge normally! (1 Unrelated Failure)

As of commit 994a888 with merge base e6d4451 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 5, 5, linux.g5.4xlarge.nvidia.gpu) (gh) (disabled by #129238 but the issue was closed recently and a rebase is needed to make it pass)
inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_huber_loss_cuda_float16

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: fd728b4f37088faa03d2c44e3c416558b42045be Pull Request resolved: #127959

[ghstack-poisoned]

ghstack-source-id: e86d6b9cdee39a442829c51db19f1a444fb95067 Pull Request resolved: #127959

[ghstack-poisoned]

ghstack-source-id: a290a3d1320918b2a2bc9e502db653398bac4c93 Pull Request resolved: #127959

[ghstack-poisoned]

ghstack-source-id: fb514383b45a7bb292683818fca9581d60e7aa24 Pull Request resolved: #127959

YuqingJ · 2024-06-11T20:20:19Z

test/test_nestedtensor.py

+ ]
+ )
+
+ def fn(values, offsets):


Do you mean def fn(values, lengths) ?

Yup, good catch

…ty when tensor does not require grad" This PR ensures that if a tensor does not require grad, and it is saved for backward, (1) unpacking in general preserve the object, (2) non-reentrant checkpoint preserves the object. [ghstack-poisoned]

…ty when tensor does not require grad" This PR ensures that if a tensor does not require grad, and it is saved for backward, (1) unpacking in general preserve the object, (2) non-reentrant checkpoint preserves the object. For custom autograd Function, you could directly save on ctx to work around this, but that would not work for when we switch to using custom ops. [ghstack-poisoned]

…t require grad " In this PR: - Ensure that if a tensor not requiring grad is saved for backward unpacking does not trigger a detach (unless the user installs a saved tensor pack hook that returns a tensor requiring grad). - Update non-reentrant checkpoint to also no longer detach for this case. Alternatives: - For custom autograd Function, you could directly save on ctx to work around this, but that would not work for when we switch to using custom ops. [ghstack-poisoned]

ghstack-source-id: bb06e4f94231b2af19ccb88e1b706e2c76c7161f Pull Request resolved: #127959

…t require grad " In this PR: - Ensure that if a tensor not requiring grad is saved for backward unpacking does not trigger a detach (unless the user installs a saved tensor pack hook that returns a tensor requiring grad). - Update non-reentrant checkpoint to also no longer detach for this case. Alternatives: - For custom autograd Function, you could directly save on ctx to work around this, but that would not work for when we switch to using custom ops. [ghstack-poisoned]

ghstack-source-id: 229a838975d0f4fc597745c4f867aa89b96a0d09 Pull Request resolved: #127959

…t require grad " In this PR: - Ensure that if a tensor not requiring grad is saved for backward unpacking does not trigger a detach (unless the user installs a saved tensor pack hook that returns a tensor requiring grad). - Update non-reentrant checkpoint to also no longer detach for this case. Alternatives: - For custom autograd Function, you could directly save on ctx to work around this, but that would not work for when we switch to using custom ops. [ghstack-poisoned]

ghstack-source-id: 7fca85d18ff52c390579e906378a9ccfd695fa1c Pull Request resolved: #127959

soulitzer · 2024-06-27T17:10:22Z

@pytorchbot merge

pytorchmergebot · 2024-06-27T17:12:14Z

Merge failed

Reason: This PR needs a release notes: label
If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Details for Dev Infra team

Raised by workflow job

YuqingJ · 2024-07-01T21:01:29Z

Ready to merge again?

soulitzer · 2024-07-01T21:50:24Z

@pytorchbot merge -i

pytorchmergebot · 2024-07-01T21:52:13Z

Merge started

Your change will be merged while ignoring the following 1 checks: pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 5, 5, linux.g5.4xlarge.nvidia.gpu)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…rad (pytorch#127959) In this PR: - Ensure that if a tensor not requiring grad is saved for backward unpacking does not trigger a detach (unless the user installs a saved tensor pack hook that returns a tensor requiring grad). - Update non-reentrant checkpoint to also no longer detach for this case. Alternatives: - For custom autograd Function, you could directly save on ctx to work around this, but that would not work for when we switch to using custom ops. Pull Request resolved: pytorch#127959 Approved by: https://github.com/YuqingJ ghstack dependencies: pytorch#125795, pytorch#128545, pytorch#129262

Support nested tensor with activation checkpoint

f120a61

[ghstack-poisoned]

soulitzer mentioned this pull request Jun 3, 2024

[checkpoint] Clean up selective activation checkpoint and make public #125795

Closed

soulitzer added a commit that referenced this pull request Jun 4, 2024

Support nested tensor with activation checkpoint

f0b0546

ghstack-source-id: fd728b4f37088faa03d2c44e3c416558b42045be Pull Request resolved: #127959

Update on "Support nested tensor with activation checkpoint"

066573e

[ghstack-poisoned]

soulitzer added a commit that referenced this pull request Jun 6, 2024

Support nested tensor with activation checkpoint

d777cb3

ghstack-source-id: e86d6b9cdee39a442829c51db19f1a444fb95067 Pull Request resolved: #127959

Update on "Support nested tensor with activation checkpoint"

fe387fe

[ghstack-poisoned]

soulitzer added a commit that referenced this pull request Jun 10, 2024

Support nested tensor with activation checkpoint

2fa9a40

ghstack-source-id: a290a3d1320918b2a2bc9e502db653398bac4c93 Pull Request resolved: #127959

Update on "Support nested tensor with activation checkpoint"

682e84a

[ghstack-poisoned]

Update on "Support nested tensor with activation checkpoint"

34a5838

[ghstack-poisoned]

Update on "Support nested tensor with activation checkpoint"

b1c1ba9

[ghstack-poisoned]

soulitzer requested a review from albanD as a code owner June 11, 2024 13:16

Update on "Support nested tensor with activation checkpoint"

046ca66

[ghstack-poisoned]

soulitzer added a commit that referenced this pull request Jun 11, 2024

Support nested tensor with activation checkpoint

c5427a8

ghstack-source-id: fb514383b45a7bb292683818fca9581d60e7aa24 Pull Request resolved: #127959

soulitzer changed the title ~~Support nested tensor with activation checkpoint~~ [checkpoint] Activation checkpoint preserves object identity when tensor does not require grad Jun 11, 2024

YuqingJ reviewed Jun 11, 2024

View reviewed changes

soulitzer marked this pull request as draft June 12, 2024 15:45

soulitzer mentioned this pull request Jun 12, 2024

[autograd] Do not stash version counter for saved tensor #128545

Closed

soulitzer changed the title ~~[checkpoint] Activation checkpoint preserves object identity when tensor does not require grad~~ [autograd] Do not detach when unpacking tensors that do not require grad Jun 12, 2024

soulitzer added a commit that referenced this pull request Jun 18, 2024

Support nested tensor with activation checkpoint

86b84d8

ghstack-source-id: bb06e4f94231b2af19ccb88e1b706e2c76c7161f Pull Request resolved: #127959

soulitzer mentioned this pull request Jun 21, 2024

Allow SAC policy_fn to return bool for backward compatibility #129262

Closed

soulitzer added a commit that referenced this pull request Jun 21, 2024

Support nested tensor with activation checkpoint

a900e74

ghstack-source-id: 229a838975d0f4fc597745c4f867aa89b96a0d09 Pull Request resolved: #127959

soulitzer added a commit that referenced this pull request Jun 25, 2024

Support nested tensor with activation checkpoint

3f5e67d

ghstack-source-id: 7fca85d18ff52c390579e906378a9ccfd695fa1c Pull Request resolved: #127959

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jun 27, 2024

pytorchmergebot added the merging label Jun 27, 2024

pytorchmergebot removed the merging label Jun 27, 2024

soulitzer added the release notes: autograd release notes category label Jun 27, 2024

pytorchmergebot added the merging label Jul 1, 2024

pytorchmergebot added the Merged label Jul 1, 2024

pytorchmergebot closed this in eeef686 Jul 1, 2024

pytorchmergebot removed the merging label Jul 1, 2024

github-actions bot deleted the gh/soulitzer/306/head branch August 1, 2024 02:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[autograd] Do not detach when unpacking tensors that do not require grad #127959

[autograd] Do not detach when unpacking tensors that do not require grad #127959

soulitzer commented Jun 4, 2024 •

edited

Loading

pytorch-bot bot commented Jun 4, 2024 •

edited

Loading

YuqingJ Jun 11, 2024

soulitzer Jun 11, 2024

soulitzer commented Jun 27, 2024

pytorchmergebot commented Jun 27, 2024

YuqingJ commented Jul 1, 2024

soulitzer commented Jul 1, 2024

pytorchmergebot commented Jul 1, 2024

[autograd] Do not detach when unpacking tensors that do not require grad #127959

[autograd] Do not detach when unpacking tensors that do not require grad #127959

Conversation

soulitzer commented Jun 4, 2024 • edited Loading

pytorch-bot bot commented Jun 4, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/127959

❗ 1 Active SEVs

✅ You can merge normally! (1 Unrelated Failure)

YuqingJ Jun 11, 2024

Choose a reason for hiding this comment

soulitzer Jun 11, 2024

Choose a reason for hiding this comment

soulitzer commented Jun 27, 2024

pytorchmergebot commented Jun 27, 2024

Merge failed

YuqingJ commented Jul 1, 2024

soulitzer commented Jul 1, 2024

pytorchmergebot commented Jul 1, 2024

Merge started

soulitzer commented Jun 4, 2024 •

edited

Loading

pytorch-bot bot commented Jun 4, 2024 •

edited

Loading