RL-path-planning

Path planning using reinforcement learning in a n x n grid environment:

Starting position at (0, 0) (top left), and goal position at (n-1, n-1) (bottom right).

Current techniques

Monte-Carlo control without exploring starts
SARSA with an $\epsilon$-greedy behavior policy
Q-learning with an $\epsilon$-greedy behavior policy

Monte-Carlo control without exploring starts

SARSA with an $\epsilon$-greedy behavior policy

Update rule:

$$Q\left(S_t,A_t\right)← Q\left(S_t,A_t\right)+\alpha\left[R_{t+1}+\gamma Q\left(S_{t+1}, A_{t+1}\right)-Q\left(S_t,A_t\right)\right]$$

Q-learning with an $\epsilon$-greedy behavior policy

Update rule:

$$Q\left(S_t,A_t\right)← Q\left(S_t,A_t\right)+\alpha\left[R_{t+1}+\gamma \text{max}_ {a'}Q\left(S_{t+1}, a'\right)-Q\left(S_t,A_t\right)\right]$$

$\epsilon$-greedy policy

$$\pi\left(a|s\right) \begin{cases} 1-\epsilon+\frac{\epsilon}{\left|A\left(s\right)\right|}, & \text{if}\ a=A^{*}≜\text{argmax}_{a}Q\left(s,a\right) \\ \frac{\epsilon}{\left|A\left(s\right)\right|}, & \text{if}\ a\neq A^{*} \end{cases}$$

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
img		img
.gitignore		.gitignore
Monte_Carlo_without_es.py		Monte_Carlo_without_es.py
Q_learning.py		Q_learning.py
README.md		README.md
RL.py		RL.py
Sarsa.py		Sarsa.py
project 1.ipynb		project 1.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL-path-planning

Monte-Carlo control without exploring starts

SARSA with an $\epsilon$-greedy behavior policy

Q-learning with an $\epsilon$-greedy behavior policy

$\epsilon$-greedy policy

About

Releases

Packages

Languages

pngqunshen/RL-path-planning

Folders and files

Latest commit

History

Repository files navigation

RL-path-planning

Monte-Carlo control without exploring starts

SARSA with an $\epsilon$-greedy behavior policy

Q-learning with an $\epsilon$-greedy behavior policy

$\epsilon$-greedy policy

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages