- Demonstration using PPO to teach a single agent to balance a ball on its head.
- Example of training Unity ML agents using the lower level Python API, using PPO implementation from OpenAI's spinningup implementation which uses MPI for parrallel training.
- Added support for logging in tensorboard, saving experiments and configurations for experiments.
- Also contains some legacy code for DDPG and TD3 (WIP).
- The environment is custom built from the examples given by Unity, to include only 1 agent instead of 12 due to limitations of the python wrappers.
-
Notifications
You must be signed in to change notification settings - Fork 0
dogeplusplus/ball-balancer
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Reinforcement learning using PPO on the 3D Ball environment for Unity ML-Agents. Using MPI to do distributed training across separate processes
Topics
Resources
Stars
Watchers
Forks
Packages 0
No packages published