This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
-
Updated
Jan 16, 2021 - Python
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
PyTorch implementation of Trust Region Policy Optimization
Python implementation of some numerical (optimization) methods
Trust Region Policy Optimization (TRPO) in pure TensorFlow
A collection of Reinforcement Learning implementations with PyTorch
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
Benchmarking the Natural Gradient in Policy Gradient Methods and Evolution Strategies
Course projects of CS395T Numerical Optimization, UT Austin
The pytorch implemetation of trpo
Add a description, image, and links to the trust-region-policy-optimization topic page so that developers can more easily learn about it.
To associate your repository with the trust-region-policy-optimization topic, visit your repo's landing page and select "manage topics."