Simple single file implementations of Reinforcement Learning algorithms in Julia.
Inspired by CleanRL
Simple DQNA2C- Rainbow
PPODDPG- SAC
General replay bufferCLI for hyperparametersSupport loggers- GPU training
Multi-thread PPO- Vectorized envs
PlottingMulti-loggers (file/console/Tensorboard)Make nn inputs F32 - F32 env wrapper?- done for PPO- Make individual file runners e.g experiments/run_ppo.(jl/sh)
- Better logging - log interval and always log at correct step
- Profile PPO