[RLlib] Change default framework from tf to torch #33604

kouroshHakha · 2023-03-22T21:48:51Z

Why are these changes needed?

This PR changes the default framework_str from tf to either torch or tf2. First step towards hopefully deprecating tf1 stack.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

sven1977 · 2023-03-22T22:27:00Z

rllib/algorithms/algorithm_config.py

@@ -261,7 +261,7 @@ def __init__(self, algo_class=None):
 self.placement_strategy = "PACK"

 # `self.framework()`
- self.framework_str = "tf"
+ self.framework_str = "torch"


:fingers-crossed: :)

sven1977

Looks great! Some example scripts used to run only on tf and will now only run on torch, but I guess that's ok. E.g. custom_metrics_and_callbacks.

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

gjoliver · 2023-03-24T17:19:04Z

rllib/examples/documentation/saving_and_loading_algos_and_policies.py

@@ -297,6 +298,6 @@ def new_policy_mapping_fn(agent_id, episode, worker, **kwargs):

 # __export-models-as-onnx-begin__
 # Using the same Policy object, we can also export our NN Model in the ONNX format:
-ppo_policy.export_model("/tmp/my_nn_model", onnx=True)
+ppo_policy.export_model("/tmp/my_nn_model", onnx=False)


update comment?

gjoliver · 2023-03-24T17:19:29Z

rllib/BUILD

@@ -1618,7 +1618,7 @@ py_test(
 py_test(
 name = "connectors/tests/test_agent",
 tags = ["team:rllib", "connector"],
- size = "small",
+ size = "medium",


this is from some other pr right?

The other PR will get merged and this difference will go away. I wanted to make sure the tests on CI doesn't get red b/c of time outs.

gjoliver · 2023-03-24T17:19:54Z

rllib/env/tests/test_multi_agent_env.py

@@ -449,6 +449,7 @@ def compute_actions_from_input_dict(
 env_creator=lambda _: MultiAgentCartPole({"num_agents": 2}),
 default_policy_class=ModelBasedPolicy,
 config=DQNConfig()
+ .framework("tf")


curious, multi-agent env doesn't work with torch?

It does. This test is overfitted to tf.

gjoliver · 2023-03-24T18:08:21Z

ok ok

* changed default in algo config * implicitly added tf framework to the test scripts Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

* changed default in algo config * implicitly added tf framework to the test scripts Signed-off-by: Kourosh Hakhamaneshi <[email protected]> Signed-off-by: bhuang <[email protected]>

* changed default in algo config * implicitly added tf framework to the test scripts Signed-off-by: Kourosh Hakhamaneshi <[email protected]> Signed-off-by: Jonathan Carter <[email protected]>

* changed default in algo config * implicitly added tf framework to the test scripts Signed-off-by: Kourosh Hakhamaneshi <[email protected]> Signed-off-by: elliottower <[email protected]>

* changed default in algo config * implicitly added tf framework to the test scripts Signed-off-by: Kourosh Hakhamaneshi <[email protected]> Signed-off-by: Jack He <[email protected]>

kouroshHakha added 3 commits March 22, 2023 14:34

changed default in algo config

315d391

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

implicitly added tf framework to the test scripts

b663185

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

changed script defaults from tf to either tf2 or torch

0f587cb

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

kouroshHakha requested review from sven1977, gjoliver, avnishn, ArturNiederfahrenhorst, smorad, maxpumperla and krfricke as code owners March 22, 2023 21:48

changed framework in yaml files from tf to torch or tf2

7e2888b

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

kouroshHakha assigned sven1977, avnishn, ArturNiederfahrenhorst and gjoliver Mar 22, 2023

left one out

96145c7

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

sven1977 reviewed Mar 22, 2023

View reviewed changes

sven1977 approved these changes Mar 22, 2023

View reviewed changes

kouroshHakha added 6 commits March 22, 2023 16:46

test_models fixed

4e34397

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

fixed the failing tests

54add73

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

Merge branch 'master' into change-default-from-tf-to-torch

55862a8

fixed two_trainer_workflow

a1f78e2

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

lint

f3a3ae8

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

wip

19e60f6

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

gjoliver reviewed Mar 24, 2023

View reviewed changes

gjoliver merged commit 8d2dc9a into ray-project:master Mar 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Change default framework from tf to torch #33604

[RLlib] Change default framework from tf to torch #33604

kouroshHakha commented Mar 22, 2023

sven1977 Mar 22, 2023

sven1977 left a comment

gjoliver Mar 24, 2023

gjoliver Mar 24, 2023

kouroshHakha Mar 24, 2023

gjoliver Mar 24, 2023

kouroshHakha Mar 24, 2023

gjoliver commented Mar 24, 2023

[RLlib] Change default framework from tf to torch #33604

[RLlib] Change default framework from tf to torch #33604

Conversation

kouroshHakha commented Mar 22, 2023

Why are these changes needed?

Related issue number

Checks

sven1977 Mar 22, 2023

Choose a reason for hiding this comment

sven1977 left a comment

Choose a reason for hiding this comment

gjoliver Mar 24, 2023

Choose a reason for hiding this comment

gjoliver Mar 24, 2023

Choose a reason for hiding this comment

kouroshHakha Mar 24, 2023

Choose a reason for hiding this comment

gjoliver Mar 24, 2023

Choose a reason for hiding this comment

kouroshHakha Mar 24, 2023

Choose a reason for hiding this comment

gjoliver commented Mar 24, 2023