Sample Efficient Actor-Critic with Experience Replay

Wang, Ziyu; Bapst, Victor; Heess, Nicolas; Mnih, Volodymyr; Munos, Remi; Kavukcuoglu, Koray; de Freitas, Nando

Computer Science > Machine Learning

arXiv:1611.01224 (cs)

[Submitted on 3 Nov 2016 (v1), last revised 10 Jul 2017 (this version, v2)]

Title:Sample Efficient Actor-Critic with Experience Replay

Authors:Ziyu Wang, Victor Bapst, Nicolas Heess, Volodymyr Mnih, Remi Munos, Koray Kavukcuoglu, Nando de Freitas

View PDF

Abstract:This paper presents an actor-critic deep reinforcement learning agent with experience replay that is stable, sample efficient, and performs remarkably well on challenging environments, including the discrete 57-game Atari domain and several continuous control problems. To achieve this, the paper introduces several innovations, including truncated importance sampling with bias correction, stochastic dueling network architectures, and a new trust region policy optimization method.

Comments:	20 pages. Prepared for ICLR 2017
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1611.01224 [cs.LG]
	(or arXiv:1611.01224v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1611.01224

Submission history

From: Ziyu Wang [view email]
[v1] Thu, 3 Nov 2016 23:21:32 UTC (1,409 KB)
[v2] Mon, 10 Jul 2017 14:38:10 UTC (2,708 KB)

Computer Science > Machine Learning

Title:Sample Efficient Actor-Critic with Experience Replay

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Sample Efficient Actor-Critic with Experience Replay

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators