When to use parametric models in reinforcement learning?

van Hasselt, Hado; Hessel, Matteo; Aslanides, John

Computer Science > Machine Learning

arXiv:1906.05243 (cs)

[Submitted on 12 Jun 2019]

Title:When to use parametric models in reinforcement learning?

Authors:Hado van Hasselt, Matteo Hessel, John Aslanides

View PDF

Abstract:We examine the question of when and how parametric models are most useful in reinforcement learning. In particular, we look at commonalities and differences between parametric models and experience replay. Replay-based learning algorithms share important traits with model-based approaches, including the ability to plan: to use more computation without additional data to improve predictions and behaviour. We discuss when to expect benefits from either approach, and interpret prior work in this context. We hypothesise that, under suitable conditions, replay-based algorithms should be competitive to or better than model-based algorithms if the model is used only to generate fictional transitions from observed states for an update rule that is otherwise model-free. We validated this hypothesis on Atari 2600 video games. The replay-based algorithm attained state-of-the-art data efficiency, improving over prior results with parametric models.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1906.05243 [cs.LG]
	(or arXiv:1906.05243v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1906.05243
Journal reference:	NeurIPS 2019

Submission history

From: Hado van Hasselt [view email]
[v1] Wed, 12 Jun 2019 16:57:00 UTC (142 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-06

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Hado van Hasselt
Matteo Hessel
John Aslanides

export BibTeX citation

Computer Science > Machine Learning

Title:When to use parametric models in reinforcement learning?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:When to use parametric models in reinforcement learning?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators