[RLlib] Add `on_checkpoint_loaded` callback AND also store eval workers' `policy_mapping_fn` in algo state. #40350

sven1977 · 2023-10-14T17:02:24Z

This PR fixes a problem with heavily customized eval WorkerSet setups, policy sets, and mapping functions.

Why are these changes needed?

When a user overrides the on_algorithm_init callback in order to setup special evaluation policies inside the evaluation worker set, including a new eval policy mapping function, then upon restoring this algorithm from a checkpoint, the eval policy_mapping_fn information would be overridden by the main policy_mapping_fn (b/c that one is the only one stored in the checkpoint).

To solve this problem and to add additional handles for users with such complex customization needs, this PR:

Stores the eval workers' policy_mapping_fn in the algorithm state, in case this mapping is different from the main mapping function.
Adds on_checkpoint_loaded() callback called after the Algorithm was restored from a checkpoint (Algorithm.load_checkpoint() has completed).
New test cases have been added for both features.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: sven1977 <[email protected]>

…e59_eval_policy_mapping_fn_not_checkpointed

Signed-off-by: sven1977 <[email protected]>

kouroshHakha · 2023-10-18T15:01:55Z

rllib/algorithms/tests/test_callbacks.py

@@ -74,6 +83,29 @@ def setUpClass(cls):
 def tearDownClass(cls):
 ray.shutdown()

+ def test_on_init_and_checkpoint_loaded(self):
+ config = (
+ PGConfig()


We are moving PG to rllib contrib. use PPO.

Great catch! Will change.

kouroshHakha · 2023-10-18T15:05:27Z

some GPU restoration tests have not passed.

Signed-off-by: sven1977 <[email protected]>

…inted

Signed-off-by: sven1977 <[email protected]>

…ot_checkpointed' into issue59_eval_policy_mapping_fn_not_checkpointed

Signed-off-by: sven1977 <[email protected]>

…rs' `policy_mapping_fn` in algo state. (ray-project#40350)

wip

e57e713

Signed-off-by: sven1977 <[email protected]>

sven1977 requested review from gjoliver, avnishn, ArturNiederfahrenhorst, smorad, maxpumperla, kouroshHakha and krfricke as code owners October 14, 2023 17:02

sven1977 added 3 commits October 14, 2023 20:15

wip

dd08796

Signed-off-by: sven1977 <[email protected]>

Merge branch 'master' of https://github.com/ray-project/ray into issu…

0f94056

…e59_eval_policy_mapping_fn_not_checkpointed

wip

7582635

Signed-off-by: sven1977 <[email protected]>

sven1977 assigned kouroshHakha Oct 18, 2023

kouroshHakha approved these changes Oct 18, 2023

View reviewed changes

sven1977 added 7 commits October 19, 2023 09:55

wip

a2e3e4f

Signed-off-by: sven1977 <[email protected]>

wip

3cfd0a1

Signed-off-by: sven1977 <[email protected]>

Merge branch 'master' into issue59_eval_policy_mapping_fn_not_checkpo…

0986683

…inted

wip

234fdc0

Signed-off-by: sven1977 <[email protected]>

Merge remote-tracking branch 'origin/issue59_eval_policy_mapping_fn_n…

34808b5

…ot_checkpointed' into issue59_eval_policy_mapping_fn_not_checkpointed

wip

4f8312f

Signed-off-by: sven1977 <[email protected]>

wip

aa2c4bc

Signed-off-by: sven1977 <[email protected]>

sven1977 merged commit bdc9f83 into ray-project:master Oct 19, 2023
21 of 28 checks passed

rickyyx mentioned this pull request Oct 26, 2023

Release test long_running_many_actor_tasks failed #40568

Closed

jonathan-anyscale pushed a commit to jonathan-anyscale/ray that referenced this pull request Oct 26, 2023

[RLlib] Add on_checkpoint_loaded callback AND also store eval worke…

22cd9fb

…rs' `policy_mapping_fn` in algo state. (ray-project#40350)

jonathan-anyscale pushed a commit to jonathan-anyscale/ray that referenced this pull request Oct 26, 2023

[RLlib] Add on_checkpoint_loaded callback AND also store eval worke…

5314760

…rs' `policy_mapping_fn` in algo state. (ray-project#40350)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Add `on_checkpoint_loaded` callback AND also store eval workers' `policy_mapping_fn` in algo state. #40350

[RLlib] Add `on_checkpoint_loaded` callback AND also store eval workers' `policy_mapping_fn` in algo state. #40350

sven1977 commented Oct 14, 2023 •

edited

Loading

kouroshHakha Oct 18, 2023

sven1977 Oct 18, 2023

sven1977 Oct 19, 2023

kouroshHakha commented Oct 18, 2023

[RLlib] Add on_checkpoint_loaded callback AND also store eval workers' policy_mapping_fn in algo state. #40350

[RLlib] Add on_checkpoint_loaded callback AND also store eval workers' policy_mapping_fn in algo state. #40350

Conversation

sven1977 commented Oct 14, 2023 • edited Loading

Why are these changes needed?

Related issue number

Checks

kouroshHakha Oct 18, 2023

Choose a reason for hiding this comment

sven1977 Oct 18, 2023

Choose a reason for hiding this comment

sven1977 Oct 19, 2023

Choose a reason for hiding this comment

kouroshHakha commented Oct 18, 2023

[RLlib] Add `on_checkpoint_loaded` callback AND also store eval workers' `policy_mapping_fn` in algo state. #40350

[RLlib] Add `on_checkpoint_loaded` callback AND also store eval workers' `policy_mapping_fn` in algo state. #40350

sven1977 commented Oct 14, 2023 •

edited

Loading