Udacity-Deep-Reinforcement-Learning-p2-continuous-learning

DDPG implementation for continuous action space

The Problem Description

The Environment: In this environment, a double-jointed arm can move to target locations. The environment is based on Unity ML-agents.

Note: The Unity ML-Agent team frequently releases updated versions of their environment. This project uses v0.4 interface. The project environment provided by Udacity is similar to, but not identical to the Reacher environment on the Unity ML-Agents GitHub page.

For this project, Udacity provides two separate versions of the Unity environment:

The first version contains a single agent.
The second version contains 20 identical agents, each with its own copy of the environment.

The observation space: The observation space consists of 33 variables corresponding to position, rotation, velocity, and angular velocities of the arm.

The action space: Each action is a vector with four numbers, corresponding to torque applicable to two joints. Every entry in the action vector should be a number between -1 and 1.

Rewards: A reward of +0.1 is provided for each step that the agent's hand is in the goal location

Task (Episodic/Continuous): The task is continuos, so we use a limit of max timesteps for each episode.

Solution: I have decided to go with the second version. The agents thus need to be trained so they get an average score of +30 (over 100 consecutive episodes, and over all agents)

Getting started

Installation requirements

To begin with, you need to configure a Python 3.6 / PyTorch 0.4.0 environment with the requirements described in Udacity repository
Then you need to clone this project and have it accessible in your Python environment
For this project, you will not need to install Unity. You need to only select the environment that matches your operating system:
- Version 1: One (1) Agent
  - Linux: click here
  - Mac OSX: click here
  - Windows (32-bit): click here
  - Windows (64-bit): click here
- Version 2: Twenty (20) Agents
  - Linux: click here
  - Mac OSX: click here
  - Windows (32-bit): click here
  - Windows (64-bit): click here
Finally, you can unzip the environment archive in the project's environment directory and set the path to the UnityEnvironment in the code.

Instructions

The configuration for the environement, the agent and the DDPG parameters are mentioned in the config file.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
images		images
Continuous_Control.ipynb		Continuous_Control.ipynb
DDPG_agent.py		DDPG_agent.py
README.md		README.md
checkpoint_actor.pth		checkpoint_actor.pth
checkpoint_critic.pth		checkpoint_critic.pth
config.ini		config.ini
model.py		model.py
report.md		report.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Udacity-Deep-Reinforcement-Learning-p2-continuous-learning

The Problem Description

Getting started

Installation requirements

Instructions

About

Releases

Packages

Languages

sulagnag/Udacity-Deep-Reinforcement-Learning-p2-continuous-learning

Folders and files

Latest commit

History

Repository files navigation

Udacity-Deep-Reinforcement-Learning-p2-continuous-learning

The Problem Description

Getting started

Installation requirements

Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages