Taxi-v1---OpenAI-Gym-Reinforcement-Deep_Q_Learning-

This task was introduced in [Dietterich2000] to illustrate some issues in hierarchical reinforcement learning. There are 4 locations (labeled by different letters) and your job is to pick up the passenger at one location and drop him off in another. You receive +20 points for a successful dropoff, and lose 1 point for every timestep it takes. There is also a 10 point penalty for illegal pick-up and drop-off actions.

Episode 20000/20000 || Best average reward 9.298

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
__pycache__		__pycache__
Agent.py		Agent.py
Monitor.py		Monitor.py
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Taxi-v1---OpenAI-Gym-Reinforcement-Deep_Q_Learning-

About

Releases

Packages

Languages

hk3427/Taxi-v1---OpenAI-Gym

Folders and files

Latest commit

History

Repository files navigation

Taxi-v1---OpenAI-Gym-Reinforcement-Deep_Q_Learning-

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages