Skip to content
/ tennis Public

Collaboration and competition project for the Udacity deep RL nanodegree

Notifications You must be signed in to change notification settings

odellus/tennis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

tennis

Effort to train an agent (or agents) to play tennis collaboratively.

ENVIRONMENT

Some important information about the tennis environment from Unity's ML-Agents: the size of the action space is two. The size of the naive state space is 8, but we use three consecutive states as one so the dimension of the state space used to map states onto actions is 24. The environment is considered solved when an average score of 0.5 over the last 100 episodes is reached.

SETUP

If you don't wish to use MongoDB to persist experimental results be sure you have Persist_mongodb set to False in the configurations, and just run bash setup.sh to install.

If you wish to use MongoDB to persist the results of the experiments, please install MongoDB, then run in the terminal

# Install dependencies
python3 -m pip install --user torch matplotlib numpy pymongo
# Get Tennis environment
curl -o Tennis_Linux_NoVis.zip https://s3-us-west-1.amazonaws.com/udacity-drlnd/P3/Tennis/Tennis_Linux_NoVis.zip
unzip -d . Tennis_Linux_NoVis.unzip
# Get the python API
git clone -b 0.4.0b https://github.com/Unity-Technologies/ml-agents.git ./ml-agents

RUN

To train a DDPG network open a terminal and enter

python3 ddpg.py

Good luck and happy hunting!

About

Collaboration and competition project for the Udacity deep RL nanodegree

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published