Continuous control with deep reinforcement learning

Lillicrap, Timothy P.; Hunt, Jonathan J.; Pritzel, Alexander; Heess, Nicolas; Erez, Tom; Tassa, Yuval; Silver, David; Wierstra, Daan

Computer Science > Machine Learning

arXiv:1509.02971 (cs)

[Submitted on 9 Sep 2015 (v1), last revised 5 Jul 2019 (this version, v6)]

Title:Continuous control with deep reinforcement learning

Authors:Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, Daan Wierstra

View PDF

Abstract:We adapt the ideas underlying the success of Deep Q-Learning to the continuous action domain. We present an actor-critic, model-free algorithm based on the deterministic policy gradient that can operate over continuous action spaces. Using the same learning algorithm, network architecture and hyper-parameters, our algorithm robustly solves more than 20 simulated physics tasks, including classic problems such as cartpole swing-up, dexterous manipulation, legged locomotion and car driving. Our algorithm is able to find policies whose performance is competitive with those found by a planning algorithm with full access to the dynamics of the domain and its derivatives. We further demonstrate that for many of the tasks the algorithm can learn policies end-to-end: directly from raw pixel inputs.

Comments:	10 pages + supplementary
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1509.02971 [cs.LG]
	(or arXiv:1509.02971v6 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1509.02971

Submission history

From: Jonathan Hunt [view email]
[v1] Wed, 9 Sep 2015 23:01:36 UTC (344 KB)
[v2] Wed, 18 Nov 2015 17:34:41 UTC (338 KB)
[v3] Thu, 7 Jan 2016 19:09:07 UTC (338 KB)
[v4] Tue, 19 Jan 2016 20:30:47 UTC (339 KB)
[v5] Mon, 29 Feb 2016 18:45:53 UTC (339 KB)
[v6] Fri, 5 Jul 2019 10:47:27 UTC (339 KB)

Computer Science > Machine Learning

Title:Continuous control with deep reinforcement learning

Submission history

Access Paper:

References & Citations

15 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Continuous control with deep reinforcement learning

Submission history

Access Paper:

References & Citations

15 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators