Undergraduate Dissertation (University of Malta) 2020-2023 - 'Autonomous Drone Control using Reinforcement Learning''
-
Updated
Dec 4, 2023 - Jupyter Notebook
Undergraduate Dissertation (University of Malta) 2020-2023 - 'Autonomous Drone Control using Reinforcement Learning''
Scheduling TRPO's KL Divergence Constraint
A trading bitcoin agent was created with deep reinforcement learning implementations.
A collection of Reinforcement Learning implementations with PyTorch
The pytorch implemetation of trpo
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
Python implementation of some numerical (optimization) methods
Benchmarking the Natural Gradient in Policy Gradient Methods and Evolution Strategies
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
My solutions to the labs from this bootcamp:
PyTorch implementation of Trust Region Policy Optimization
Trust Region Policy Optimization (TRPO) in pure TensorFlow
works about solving nonlinear dynamic systems
Course projects of CS395T Numerical Optimization, UT Austin
Add a description, image, and links to the trust-region-policy-optimization topic page so that developers can more easily learn about it.
To associate your repository with the trust-region-policy-optimization topic, visit your repo's landing page and select "manage topics."