forked from openai/baselines
-
Notifications
You must be signed in to change notification settings - Fork 727
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ACER performance on Breakout #103
Comments
I experienced this same problem with PPO-1. I ran with verbose=1, and tensorboard logging on, but all other parameters left default. After 1.3M frames or so, the episode reward for Breakout was still pegged around 2.0: effectively what a random agent would produce. |
Now merged with master |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Although ACER seems to give good performance on most atari games,
it still fails on Breakout.
Code: https://github.com/araffin/rl-baselines-zoo
We should double check the implementation.
The text was updated successfully, but these errors were encountered: