The pytorch implemetation of trpo
-
Updated
Mar 14, 2022 - Python
The pytorch implemetation of trpo
Scheduling TRPO's KL Divergence Constraint
Course projects of CS395T Numerical Optimization, UT Austin
My solutions to the labs from this bootcamp:
Undergraduate Dissertation (University of Malta) 2020-2023 - 'Autonomous Drone Control using Reinforcement Learning''
works about solving nonlinear dynamic systems
Benchmarking the Natural Gradient in Policy Gradient Methods and Evolution Strategies
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
A collection of Reinforcement Learning implementations with PyTorch
Trust Region Policy Optimization (TRPO) in pure TensorFlow
A trading bitcoin agent was created with deep reinforcement learning implementations.
Python implementation of some numerical (optimization) methods
PyTorch implementation of Trust Region Policy Optimization
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Add a description, image, and links to the trust-region-policy-optimization topic page so that developers can more easily learn about it.
To associate your repository with the trust-region-policy-optimization topic, visit your repo's landing page and select "manage topics."