Implemenation of DDPG with numpy only (without Tensorflow)
neural-network
policy-gradient
reinforcement-learning-algorithms
stochastic-gradient-descent
adam-optimizer
deep-deterministic-policy-gradient
actor-critic-algorithm
replay-buffer
target-network
-
Updated
Mar 4, 2018 - Python