[RLlib; Documentation] Some docstring cleanups; Rename RemoteVectorEnv into RemoteBaseEnv for clarity. #20250

sven1977 · 2021-11-11T12:48:58Z

Some docstring cleanups
Soft-Rename RemoteVectorEnv into RemoteBaseEnv for clarity.

Why are these changes needed?

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…_more_docstring_cleanups

gjoliver

1 question and 2 minor comments. should be quick.

gjoliver · 2021-11-16T20:41:14Z

rllib/env/remote_base_env.py

+ while not ready:
+ ready, _ = ray.wait(
+ list(self.pending),
+ num_returns=len(self.pending),


this seems to be waiting for everything to be ready?
looking at the code below, I think we don't need the while loop here. everything is assuming ready will have the full list of obj_ref in one shot.

I don't think that's true. In my understanding, ray.wait returns a tuple: 1) list of ready-handles and 2) list of non-ready handles. As soon as there is at least one handle ready, we continue and return observations from only those ready ray.remote sub-environments.

ok, yeah, that's the self.pending thing I guess. let me double check the semantics for num_wait.

In any case though, I think we should get rid of the while loop and the time_out parameter, and this is the same thing right?

ok I verified a couple of things and understand this completely now :)
so normally num_returns=len(self.pending) would wait until all pending actors come back, however we also have timeout=self.poll_timeout there ... so effectively the logic here is:

we wait self.poll_timeout, if all workers come back with a batch in time, great, and continue.

if some workers are slow, and don't come back in poll_timeout, we leave them, and continue, so we don't wait more than poll_timeout interval.

if all workers are slow, and nothing comes back in poll_timeout though, we will ignore poll_timeout and keep waiting until at least something comes back, because of the while loop. this may or may not be an intended behavior actually, for off policy cases?

I think as we discussed offline, let's just comment about this behavior and leave it as is for now. this sounds like a performance impacting change to me, and we should do it by itself I guess.
I am also thinking if we are ok with getting data back from some of the workers, then we don't really need to have the users specify this parameter, which will be 1 fewer confusing parameter for the users :)

but, yeah, we can come back to this. thanks :)

1., 2., and 3.: Yeah, that's how I understood it now, too. So removing the timeout would actually alter the logic here.

gjoliver · 2021-11-16T21:09:46Z

rllib/env/remote_base_env.py

+ else:
+ ob = {_DUMMY_AGENT_ID: ret}
+
+ if rew is None:


is this a fix for a specific env/version or a general safety net for user's custom envs?
feel like this may be concealing bugs if they forgot to return reward, for example.

Great question. It's when we have a return value from reset() (rather than step()). Reset does not return rewards (only 1 return value; step=4 return values).
Added a comment explaining this.

gjoliver · 2021-11-16T21:13:38Z

rllib/env/remote_base_env.py

+ self.poll_timeout = remote_env_batch_wait_ms / 1000
+
+ self.actors = None # lazy init
+ self.pending = None # lazy init


any chance you can explain the role of self.pending a bit here?
I found it to be quite important, carrying logic between member functions actually.

Added a comment.

…_more_docstring_cleanups

gjoliver

looks great, thanks for doing this man!!

sven1977 added 2 commits November 11, 2021 13:48

wip.

401e04b

LINT.

e30b6e1

sven1977 requested a review from avnishn November 11, 2021 12:50

sven1977 assigned avnishn Nov 11, 2021

sven1977 added 5 commits November 11, 2021 17:59

Merge branch 'master' of https://github.com/ray-project/ray into docs…

657e9e8

…_more_docstring_cleanups

wip.

295c773

Merge branch 'master' of https://github.com/ray-project/ray into docs…

fb2a098

…_more_docstring_cleanups

fix.

ed8db2f

fix.

2bfbb1a

sven1977 added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Nov 12, 2021

sven1977 requested a review from gjoliver November 16, 2021 11:55

sven1977 assigned gjoliver Nov 16, 2021

gjoliver reviewed Nov 16, 2021

View reviewed changes

sven1977 added 2 commits November 17, 2021 09:47

Merge branch 'master' of https://github.com/ray-project/ray into docs…

5194baf

…_more_docstring_cleanups

wip.

7209b2d

gjoliver approved these changes Nov 17, 2021

View reviewed changes

wip.

83a56e4

sven1977 merged commit 56619b9 into ray-project:master Nov 17, 2021

sven1977 deleted the docs_more_docstring_cleanups branch June 2, 2023 20:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib; Documentation] Some docstring cleanups; Rename RemoteVectorEnv into RemoteBaseEnv for clarity. #20250

[RLlib; Documentation] Some docstring cleanups; Rename RemoteVectorEnv into RemoteBaseEnv for clarity. #20250

sven1977 commented Nov 11, 2021 •

edited

Loading

gjoliver left a comment

gjoliver Nov 16, 2021

sven1977 Nov 17, 2021

gjoliver Nov 17, 2021

gjoliver Nov 17, 2021

sven1977 Nov 17, 2021 •

edited

Loading

gjoliver Nov 16, 2021

sven1977 Nov 17, 2021

gjoliver Nov 16, 2021

sven1977 Nov 17, 2021

gjoliver left a comment

[RLlib; Documentation] Some docstring cleanups; Rename RemoteVectorEnv into RemoteBaseEnv for clarity. #20250

[RLlib; Documentation] Some docstring cleanups; Rename RemoteVectorEnv into RemoteBaseEnv for clarity. #20250

Conversation

sven1977 commented Nov 11, 2021 • edited Loading

Why are these changes needed?

Related issue number

Checks

gjoliver left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sven1977 Nov 17, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gjoliver left a comment

Choose a reason for hiding this comment

sven1977 commented Nov 11, 2021 •

edited

Loading

sven1977 Nov 17, 2021 •

edited

Loading