Skip to content

Solving the custom cartpole balance problem in gazebo environment using Proximal Policy Optimization(PPO)

Notifications You must be signed in to change notification settings

navuboy/ppo_gazebo_tf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 

Repository files navigation

ppo_gazebo_tf

Reinforcement Learning for solving the custom cartpole balance problem in gazebo environment using Proximal Policy Optimization(PPO).

Environment

  • Custom cartpole environment(from OpenAI gym) in gazebo.
  • Observation Space: 4 (continuos)
  • Action Space: 2 (discrete)
  • Reward range:

Dependencies

File setup:

  • cartpole_gazebo contains the robot model(both .stl files & .urdf file) and also the gazebo launch file.

  • cartpole_controller contains the reinforcement learning implementation of Proximal Policy Optimization(PPO) for custom cartpole environment.

Training Phase:

python3 ppo_train.py

Testing trained policy:

python3 ppo_test.py

References:

TODO:

  • Use Tensorboard for plotting the training and testing graphs.

Project collaborator(s):

Arun Kumar (https://github.com/ioarun)

About

Solving the custom cartpole balance problem in gazebo environment using Proximal Policy Optimization(PPO)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published