Example TRPO implementation with ReLAx
-
Updated
Aug 29, 2022 - Jupyter Notebook
Example TRPO implementation with ReLAx
Scheduling TRPO's KL Divergence Constraint
The pytorch implemetation of trpo
Learning to Run NIPS 2017 Competition
Solving the Atari Breakout environment using Stable Baselines
A repository for easy understanding of codes in Deep Reinforcement Learning
Deep Reinforcement Learning Toolbox
ROS 2 enabled Machine Learning algorithms
A pytorch-version implementation of RL algorithms. Now it collects TRPO, ClipPPO, A2C, GAIL and ADCV.
Comparing VPG, TRPO and PPO from Policy Gradient family
My implementation of TRPO
My solutions to the labs from this bootcamp:
Reinforcement learning algorithm implements.
Add a description, image, and links to the trpo topic page so that developers can more easily learn about it.
To associate your repository with the trpo topic, visit your repo's landing page and select "manage topics."