ball-balancer

Demonstration using PPO to teach a single agent to balance a ball on its head.
Example of training Unity ML agents using the lower level Python API, using PPO implementation from OpenAI's spinningup implementation which uses MPI for parrallel training.
Added support for logging in tensorboard, saving experiments and configurations for experiments.
Also contains some legacy code for DDPG and TD3 (WIP).
The environment is custom built from the examples given by Unity, to include only 1 agent instead of 12 due to limitations of the python wrappers.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
agents		agents
algorithms		algorithms
assets		assets
configs		configs
utils		utils
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Makefile		Makefile
README.md		README.md
firstTimeSetup.sh		firstTimeSetup.sh
requirements.txt		requirements.txt

Provide feedback