[BugFix] `RewardSum` key check #1718

matteobettini · 2023-11-28T10:44:38Z

In the RewardSum transform, reset keys where checked to be valid only when not directly provided by the user.

rl/torchrl/envs/transforms/transforms.py

Line 4773 in 07fcfb1

if len(reset_keys) != len(self.in_keys) or not _check_match(

This led to silent errors when the users provides reset keys manually but there are not as many as the in_keys gathered automatically from the reward_spec.

This PR runs the key check in both the case of user provided or autoimatically gathered reset keys, making the function more resilient.

pytorch-bot · 2023-11-28T10:44:41Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/1718

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (8 Unrelated Failures)

As of commit 290ba1e with merge base 07fcfb1 ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens · 2023-11-28T10:55:05Z

Is this needed?

If we look at the message in the error, we specifically tell the user that we cannot retrieve them automatically and hence the user should pass them.
If we pass them manually, the check should be different, e.g., we could just test that there are as many in_keys as reset_keys.

matteobettini · 2023-11-28T10:58:40Z

I think having both checks is useful and at worst it cannot hurt.
The second part of the check will help preventing the user to pass "("smth","_reset") when the in_key is "reward"

The checks are run just once

vmoens · 2023-11-28T11:00:32Z

The current check makes sure that the root of in_keys has a similar root for resets. If resets are passed manually, they could be located anywhere IMO, not at the sample place as in_keys. I guess we want manual resets to be as flexible as possible. If we restrict them as we do for automatically gathered reset entries, what do we gain with the manual version?

matteobettini · 2023-11-28T11:02:43Z

I thought the manual version was just for when the auto does not work, but i see the point.
I ll update to just check the length of the key list in both cases as what to i want to avoid is the zip silently failing due to diff lenghts (which just happened to me)

vmoens · 2023-11-29T07:53:04Z

At this point we need a better set of tests for these error messages! It isn't easy to make sure we get them right just by looking at the code

This reverts commit b935c89.

matteobettini · 2023-11-29T09:14:52Z

I have added a test that would fail on main that checks all the errors related to the length of keys in rewardsum

vmoens · 2023-11-29T13:38:26Z

What do you mean "that would fail on main"? In the sense that they test the new feature or that the feature is bc breaking?

vmoens

LGTM thanks

matteobettini · 2023-11-29T13:41:10Z

What do you mean "that would fail on main"? In the sense that they test the new feature or that the feature is bc breaking?

They test that where before there was a silent error, now there is an explicit exeption.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 28, 2023

vmoens added the bug Something isn't working label Nov 28, 2023

amend

b935c89

matteobettini force-pushed the error_reward_sum branch from ee81111 to b935c89 Compare November 28, 2023 11:06

matteobettini added 3 commits November 29, 2023 08:33

Revert "amend"

4a5eb0c

This reverts commit b935c89.

amend

f3a3a4f

amend

290ba1e

vmoens approved these changes Nov 29, 2023

View reviewed changes

vmoens merged commit 2e7f574 into pytorch:main Nov 29, 2023
53 of 61 checks passed

matteobettini deleted the error_reward_sum branch December 4, 2023 11:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] `RewardSum` key check #1718

[BugFix] `RewardSum` key check #1718

matteobettini commented Nov 28, 2023

pytorch-bot bot commented Nov 28, 2023 •

edited

Loading

vmoens commented Nov 28, 2023

matteobettini commented Nov 28, 2023

vmoens commented Nov 28, 2023

matteobettini commented Nov 28, 2023 •

edited

Loading

vmoens commented Nov 29, 2023

matteobettini commented Nov 29, 2023 •

edited

Loading

vmoens commented Nov 29, 2023

vmoens left a comment

matteobettini commented Nov 29, 2023 •

edited

Loading

[BugFix] RewardSum key check #1718

[BugFix] RewardSum key check #1718

Conversation

matteobettini commented Nov 28, 2023

pytorch-bot bot commented Nov 28, 2023 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/1718

✅ You can merge normally! (8 Unrelated Failures)

vmoens commented Nov 28, 2023

matteobettini commented Nov 28, 2023

vmoens commented Nov 28, 2023

matteobettini commented Nov 28, 2023 • edited Loading

vmoens commented Nov 29, 2023

matteobettini commented Nov 29, 2023 • edited Loading

vmoens commented Nov 29, 2023

vmoens left a comment

Choose a reason for hiding this comment

matteobettini commented Nov 29, 2023 • edited Loading

[BugFix] `RewardSum` key check #1718

[BugFix] `RewardSum` key check #1718

pytorch-bot bot commented Nov 28, 2023 •

edited

Loading

matteobettini commented Nov 28, 2023 •

edited

Loading

matteobettini commented Nov 29, 2023 •

edited

Loading

matteobettini commented Nov 29, 2023 •

edited

Loading