Skip to content

Deep deterministic policy gradient (DDPG) in PyTorch 🚀

License

Notifications You must be signed in to change notification settings

zhihanyang2022/pytorch-ddpg

Repository files navigation

Deep deterministic policy gradient

A minimalistic and friendly implementation of DDPG using PyTorch.

https://spinningup.openai.com/en/latest/algorithms/ddpg.html

TODO:

  • simplify buffer
  • requirements
  • think out how to easier switch between different variants
  • important features and variants
  • hyperparameters
  • training curves and standard errors
  • action wrapper
  • some design choices and justifications

Credit:

Leadboard performances https://github.com/openai/gym/wiki/Leaderboard#pendulum-v0