FeUdal Networks for Hierarchical Reinforcement Learning

Vezhnevets, Alexander Sasha; Osindero, Simon; Schaul, Tom; Heess, Nicolas; Jaderberg, Max; Silver, David; Kavukcuoglu, Koray

Computer Science > Artificial Intelligence

arXiv:1703.01161 (cs)

[Submitted on 3 Mar 2017 (v1), last revised 6 Mar 2017 (this version, v2)]

Title:FeUdal Networks for Hierarchical Reinforcement Learning

Authors:Alexander Sasha Vezhnevets, Simon Osindero, Tom Schaul, Nicolas Heess, Max Jaderberg, David Silver, Koray Kavukcuoglu

View PDF

Abstract:We introduce FeUdal Networks (FuNs): a novel architecture for hierarchical reinforcement learning. Our approach is inspired by the feudal reinforcement learning proposal of Dayan and Hinton, and gains power and efficacy by decoupling end-to-end learning across multiple levels -- allowing it to utilise different resolutions of time. Our framework employs a Manager module and a Worker module. The Manager operates at a lower temporal resolution and sets abstract goals which are conveyed to and enacted by the Worker. The Worker generates primitive actions at every tick of the environment. The decoupled structure of FuN conveys several benefits -- in addition to facilitating very long timescale credit assignment it also encourages the emergence of sub-policies associated with different goals set by the Manager. These properties allow FuN to dramatically outperform a strong baseline agent on tasks that involve long-term credit assignment or memorisation. We demonstrate the performance of our proposed system on a range of tasks from the ATARI suite and also from a 3D DeepMind Lab environment.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1703.01161 [cs.AI]
	(or arXiv:1703.01161v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1703.01161

Submission history

From: Alexander Vezhnevets [view email]
[v1] Fri, 3 Mar 2017 14:05:11 UTC (1,754 KB)
[v2] Mon, 6 Mar 2017 18:17:18 UTC (1,748 KB)

Computer Science > Artificial Intelligence

Title:FeUdal Networks for Hierarchical Reinforcement Learning

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:FeUdal Networks for Hierarchical Reinforcement Learning

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators