reinforce_py/algorithms/REINFORCE at master · borgwang/reinforce_py

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
agent.py		agent.py
evaluation.py		evaluation.py
train_REINFORCE.py		train_REINFORCE.py

README.md

REINFORCE

REINFORCE belongs to Policy Gradient methods, which directly parameterize the policy rather than a state value function.
For more details about REINFORCE and other policy gradient algorithms, refer Chap 13 of Reinforcement Learning: An Introduction 2nd Edition

Here we use REINFORCE Methods to solve game of Pong.

Pong

The game of Pong is an Atari game which user control one of the paddle (the other one is control by a decent AI) and you have to bounce the ball past the other side. In reinforcement learning setting, the state is raw pixels and the action is moving the paddle UP or DOWN.

Requirements

Run

python train_REINFORCE.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REINFORCE

REINFORCE

README.md

REINFORCE

Pong

Requirements

Run

Files

REINFORCE

Directory actions

More options

Directory actions

More options

Latest commit

History

REINFORCE

Folders and files

parent directory

README.md

REINFORCE

Pong

Requirements

Run