Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ACER performance on Breakout #103

Open
araffin opened this issue Nov 28, 2018 · 3 comments
Open

ACER performance on Breakout #103

araffin opened this issue Nov 28, 2018 · 3 comments

Comments

@araffin
Copy link
Collaborator

araffin commented Nov 28, 2018

Although ACER seems to give good performance on most atari games,
it still fails on Breakout.
Code: https://github.com/araffin/rl-baselines-zoo

python train.py --algo acer --env BreakoutNoFrameskip-v4

We should double check the implementation.

@Migdalin
Copy link

I experienced this same problem with PPO-1. I ran with verbose=1, and tensorboard logging on, but all other parameters left default. After 1.3M frames or so, the episode reward for Breakout was still pegged around 2.0: effectively what a random agent would produce.

@araffin
Copy link
Collaborator Author

araffin commented Jul 2, 2019

@Migdalin Could you try again with the fix from #388 ?
Anyway, I would recommend you to use PPO2 (that have additional tricks compared to PPO1).

@araffin
Copy link
Collaborator Author

araffin commented Jul 4, 2019

Now merged with master

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants