REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models

Tucker, George; Mnih, Andriy; Maddison, Chris J.; Lawson, Dieterich; Sohl-Dickstein, Jascha

Computer Science > Machine Learning

arXiv:1703.07370v4 (cs)

[Submitted on 21 Mar 2017 (v1), last revised 6 Nov 2017 (this version, v4)]

Title:REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models

Authors:George Tucker, Andriy Mnih, Chris J. Maddison, Dieterich Lawson, Jascha Sohl-Dickstein

View PDF

Abstract:Learning in models with discrete latent variables is challenging due to high variance gradient estimators. Generally, approaches have relied on control variates to reduce the variance of the REINFORCE estimator. Recent work (Jang et al. 2016, Maddison et al. 2016) has taken a different approach, introducing a continuous relaxation of discrete variables to produce low-variance, but biased, gradient estimates. In this work, we combine the two approaches through a novel control variate that produces low-variance, \emph{unbiased} gradient estimates. Then, we introduce a modification to the continuous relaxation and show that the tightness of the relaxation can be adapted online, removing it as a hyperparameter. We show state-of-the-art variance reduction on several benchmark generative modeling tasks, generally leading to faster convergence to a better final log-likelihood.

Comments:	NIPS 2017
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1703.07370 [cs.LG]
	(or arXiv:1703.07370v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1703.07370

Submission history

From: George Tucker [view email]
[v1] Tue, 21 Mar 2017 18:05:31 UTC (598 KB)
[v2] Sat, 22 Apr 2017 11:04:12 UTC (1,238 KB)
[v3] Thu, 8 Jun 2017 20:54:49 UTC (6,080 KB)
[v4] Mon, 6 Nov 2017 17:50:34 UTC (6,099 KB)

Computer Science > Machine Learning

Title:REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators