trpo

Here are 74 public repositories matching this topic...

nslyubaykin / relax_trpo_example

Example TRPO implementation with ReLAx

reinforcement-learning gae policy-gradient reinforcement-learning-algorithms continuous-control trpo generalized-advantage-estimation discrete-control

Updated Aug 29, 2022
Jupyter Notebook

isk03276 / trpo-for-jaco2

Star

Trust Region Policy Optimization with TensorFlow and OpenAI Gym

jaco trpo

Updated Nov 24, 2019
Jupyter Notebook

Lolimpo / TODOList-TRPO

Star

Курсовой проект по предмету ТРПО

coursework trpo sibsutis ip-713

Updated Jun 8, 2023
C

kruglovaliza / LR03

Star

Репозиторий к лабораторной №3

trpo

Updated Apr 6, 2020

nslyubaykin / trpo_schedule_kl

Star

Scheduling TRPO's KL Divergence Constraint

reinforcement-learning scheduling policy-gradient reinforcement-learning-algorithms continuous-control trpo kl-divergence trust-region-policy-optimization

Updated Aug 29, 2022
Jupyter Notebook

dodoseung / trpo-trust-region-policy-optimization-pytorch

Star

The pytorch implemetation of trpo

deep-reinforcement-learning pytorch trpo trust-region-policy-optimization trpo-pytorch

Updated Mar 14, 2022
Python

marioyc / learning-to-run

Star

Learning to Run NIPS 2017 Competition

machine-learning reinforcement-learning tensorflow continuous-control trpo ppo

Updated Aug 18, 2017
Python

rvtsukanov / AI

Star

Project work

artificial-intelligence actor-critic trpo

Updated Mar 9, 2020
Python

SwamiKannan / Breakout-v0-using-Stable-Baselines

Star

Solving the Atari Breakout environment using Stable Baselines

dqn tensorboard-visualizations trpo ppo pytorch-implementation stable-baselines a2c-algorithm recurrent-ppo qrdqn sb3-contrib

Updated Oct 25, 2022
Jupyter Notebook

sprakashdash / RL.Fun.Do

Star

A repository for easy understanding of codes in Deep Reinforcement Learning

reinforcement-learning deep-reinforcement-learning pytorch grid-world reinforcement-learning-algorithms ddpg sac trpo ddpg-algorithm ppo a2c soft-actor-critic ddpg-pytorch trpo-pytorch ppo-pytorch openai-blog

Updated Apr 9, 2021
Python

Suman7495 / Deep-Reinforcement-Learning

Star

Deep Reinforcement Learning Toolbox

machine-learning reinforcement-learning qlearning ai deep-learning deep-reinforcement-learning artificial-intelligence dqn reinforcement-learning-algorithms rl deep-q-network ddpg trpo

Updated Apr 14, 2019
Python

YueErro / ros2learn

Star

ROS 2 enabled Machine Learning algorithms

machine-learning reinforcement-learning deep-learning robotics ml dqn rl ros2 trpo ppo acktr

Updated Jun 26, 2019
Python

SunshineJunFu / Pytorch-RL

Star

RL & Pytorch

pytorch dqn rl pg a3c ddpg trpo ppo d3pg

Updated Jun 28, 2019
Python

lx10077 / rlpy

Star

A pytorch-version implementation of RL algorithms. Now it collects TRPO, ClipPPO, A2C, GAIL and ADCV.

reinforcement-learning pytorch trpo ppo a2c control-variates mujoco-environments

Updated Oct 27, 2019
Python

songlei00 / DeepReinforcementLearningToys

Star

PyTorch Deep Reinforcement Learning.

deep-reinforcement-learning pytorch ddpg sac drl trpo a2c td3

Updated Mar 17, 2021
Python

alirezakazemipour / TRPO-PyTorch

Star

Trust Region Policy Optimization in PyTorch.

trpo

Updated Nov 28, 2022
Python

a13xe / PolicyGradientAlgorithms

Star

Comparing VPG, TRPO and PPO from Policy Gradient family

visualization python reinforcement-learning deep-learning tensorflow policy-gradient cartpole mountain-car plotting acrobat trpo ppo vpg

Updated Jun 13, 2023
Python

rudrasohan / Trust-Region-Policy-Optimization

Star

My implementation of TRPO

reinforcement-learning pytorch learning-by-doing trpo

Updated Nov 16, 2020
Python

waynemystir / deep-RL-bootcamp

Star

My solutions to the labs from this bootcamp:

reinforcement-learning deep-reinforcement-learning q-learning policy-gradient trpo trust-region-policy-optimization natural-policy-gradient

Updated Mar 22, 2019
Jupyter Notebook

hischen / reinforcement-learning

Star

Reinforcement learning algorithm implements.

reinforcement-learning qlearning monte-carlo deep-reinforcement-learning policy-gradient reinforcement-learning-algorithms multi-armed-bandit sac actor-critic trpo bandit ddpg-algorithm proximal-policy-optimization prioritized-experience-replay deepq-learning sarsa-algorithm

Updated Apr 5, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the trpo topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the trpo topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trpo

Here are 74 public repositories matching this topic...

nslyubaykin / relax_trpo_example

isk03276 / trpo-for-jaco2

Lolimpo / TODOList-TRPO

kruglovaliza / LR03

nslyubaykin / trpo_schedule_kl

dodoseung / trpo-trust-region-policy-optimization-pytorch

marioyc / learning-to-run

rvtsukanov / AI

SwamiKannan / Breakout-v0-using-Stable-Baselines

sprakashdash / RL.Fun.Do

Suman7495 / Deep-Reinforcement-Learning

YueErro / ros2learn

SunshineJunFu / Pytorch-RL

lx10077 / rlpy

songlei00 / DeepReinforcementLearningToys

alirezakazemipour / TRPO-PyTorch

a13xe / PolicyGradientAlgorithms

rudrasohan / Trust-Region-Policy-Optimization

waynemystir / deep-RL-bootcamp

hischen / reinforcement-learning

Improve this page

Add this topic to your repo