Using Pre-Training Can Improve Model Robustness and Uncertainty

Hendrycks, Dan; Lee, Kimin; Mazeika, Mantas

Computer Science > Machine Learning

arXiv:1901.09960v3 (cs)

[Submitted on 28 Jan 2019 (v1), revised 19 Jun 2019 (this version, v3), latest version 20 Oct 2019 (v5)]

Title:Using Pre-Training Can Improve Model Robustness and Uncertainty

Authors:Dan Hendrycks, Kimin Lee, Mantas Mazeika

View PDF

Abstract:He et al. (2018) have called into question the utility of pre-training by showing that training from scratch can often yield similar performance to pre-training. We show that although pre-training may not improve performance on traditional classification metrics, it improves model robustness and uncertainty estimates. Through extensive experiments on adversarial examples, label corruption, class imbalance, out-of-distribution detection, and confidence calibration, we demonstrate large gains from pre-training and complementary effects with task-specific methods. We introduce adversarial pre-training and show approximately a 10% absolute improvement over the previous state-of-the-art in adversarial robustness. In some cases, using pre-training without task-specific methods also surpasses the state-of-the-art, highlighting the need for pre-training when evaluating future methods on robustness and uncertainty tasks.

Comments:	ICML 2019. PyTorch code here: this https URL
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:1901.09960 [cs.LG]
	(or arXiv:1901.09960v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1901.09960

Submission history

From: Dan Hendrycks [view email]
[v1] Mon, 28 Jan 2019 19:37:07 UTC (159 KB)
[v2] Tue, 14 May 2019 05:52:57 UTC (7,437 KB)
[v3] Wed, 19 Jun 2019 16:37:36 UTC (7,437 KB)
[v4] Fri, 21 Jun 2019 17:14:48 UTC (7,081 KB)
[v5] Sun, 20 Oct 2019 20:09:20 UTC (7,094 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-01

Change to browse by:

cs
cs.CV
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Dan Hendrycks
Kimin Lee
Mantas Mazeika

export BibTeX citation

Computer Science > Machine Learning

Title:Using Pre-Training Can Improve Model Robustness and Uncertainty

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Using Pre-Training Can Improve Model Robustness and Uncertainty

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators