Skip to content

Reinforcement learning using PPO on the 3D Ball environment for Unity ML-Agents. Using MPI to do distributed training across separate processes

Notifications You must be signed in to change notification settings

dogeplusplus/ball-balancer

Repository files navigation

ball-balancer

  • Demonstration using PPO to teach a single agent to balance a ball on its head.
  • Example of training Unity ML agents using the lower level Python API, using PPO implementation from OpenAI's spinningup implementation which uses MPI for parrallel training.
  • Added support for logging in tensorboard, saving experiments and configurations for experiments.
  • Also contains some legacy code for DDPG and TD3 (WIP).
  • The environment is custom built from the examples given by Unity, to include only 1 agent instead of 12 due to limitations of the python wrappers.

Ball balancing demonstration

About

Reinforcement learning using PPO on the 3D Ball environment for Unity ML-Agents. Using MPI to do distributed training across separate processes

Topics

Resources

Stars

Watchers

Forks

Packages

 
 
 

Languages