Introduction

In order to solve the partially observed LunarLander-v2 OpenAI Gym environment, We tried four different methods :

1. SARSA
2. Q-Learning
3. DQN
4. DQN bonus

And we have tested their performance in two different situation, blind are width = 0.2 / 0.4.

Install

You may encounter some problem with the environment LunarLander-v2. This link could be helpful.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
data		data
models		models
.gitignore		.gitignore
Lunarlander_with_blind_area.ipynb		Lunarlander_with_blind_area.ipynb
README.md		README.md
autopilot.py		autopilot.py
deepq_network.py		deepq_network.py
exp_replay_memory.py		exp_replay_memory.py
utils.py		utils.py