Skip to content

Latest commit

 

History

History

REINFORCE

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

REINFORCE

REINFORCE belongs to Policy Gradient methods, which directly parameterize the policy rather than a state value function.
For more details about REINFORCE and other policy gradient algorithms, refer Chap 13 of Reinforcement Learning: An Introduction 2nd Edition

Here we use REINFORCE Methods to solve game of Pong.

Pong

The game of Pong is an Atari game which user control one of the paddle (the other one is control by a decent AI) and you have to bounce the ball past the other side. In reinforcement learning setting, the state is raw pixels and the action is moving the paddle UP or DOWN.

pong

Requirements

Run

python train_REINFORCE.py