You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi! I wanted to ask some questions about the current implementation of parallel environments in this framework.
As far as I understand, using ParallelPyEnvironment allows to have several environments running in parallel, each of them collection a trajectory of state/action/reward. What is not clear to me is how this is treated in the optimization part of the agents. Are these trajectories then treated individually by the optimizer? Also, what is the difference between ParallelPyEnvironment and BatchedPyEnvironment?
Hi! I wanted to ask some questions about the current implementation of parallel environments in this framework.
As far as I understand, using
ParallelPyEnvironment
allows to have several environments running in parallel, each of them collection a trajectory of state/action/reward. What is not clear to me is how this is treated in the optimization part of the agents. Are these trajectories then treated individually by the optimizer? Also, what is the difference betweenParallelPyEnvironment
andBatchedPyEnvironment
?cc @jerabaul29 and @franalcantara, feel free to ask/add information here :)
Thanks!
The text was updated successfully, but these errors were encountered: