Skip to content

A collection of Reinforcement Learning implementations with PyTorch

License

Notifications You must be signed in to change notification settings

hcnoh/rl-collection-pytorch

Repository files navigation

Reinforcement Learning Collection with PyTorch

This repository is a collection of reinforcement learning algorithms: Policy-Gradient, Actor-Critic, Trust Region Policy Optimization, and Generalized Advantage Estimation. (More algorithms will be added soon...)

In this repository, OpenAI Gym environments such as CartPole-v0, Pendulum-v0, and BipedalWalker-v3 are used. You need to install them before running this repository.

The results of CartPole environment

The results of Pendulum environment

The results of BipedalWalker environment

Future Works

  • Find the errors of the Actor-Critic
  • Implement PPO
  • Search other environments to running the algorithms

References

  • An explaination of TRPO line search: link