Skip to content

My attempt to solve the classic CartPole-v0 problem using (Deep) Reinforcement Learning

Notifications You must be signed in to change notification settings

layman-n-ish/Stable-oid

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Stable-oid

Implemented different RL algorithms to solve the infamous CartPole problem.

Algorithms:

  • Q-Learning:

    "Bucket-ised" the continous state space to construct a lookup table, a Q-table, which is used to perform updates as governed by the Bellman Optimality Equation. Check out q_learning_results.txt and the q_learning_plots folder for the write-up (on the complete training process) and plots, for consecutive runs, respectively.

    Insights from Ferdinand and Matthew.

  • Deep Q-Networks (DQN):

    (Coming soon)

To Do:

  • Tuning and update q_learning_results.txt
  • Implement DQN (with experience replay?)

About

My attempt to solve the classic CartPole-v0 problem using (Deep) Reinforcement Learning

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages