Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks

Xu, Weilin; Evans, David; Qi, Yanjun

doi:10.14722/ndss.2018.23198

Computer Science > Computer Vision and Pattern Recognition

arXiv:1704.01155 (cs)

[Submitted on 4 Apr 2017 (v1), last revised 5 Dec 2017 (this version, v2)]

Title:Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks

Authors:Weilin Xu, David Evans, Yanjun Qi

View PDF

Abstract:Although deep neural networks (DNNs) have achieved great success in many tasks, they can often be fooled by \emph{adversarial examples} that are generated by adding small but purposeful distortions to natural examples. Previous studies to defend against adversarial examples mostly focused on refining the DNN models, but have either shown limited success or required expensive computation. We propose a new strategy, \emph{feature squeezing}, that can be used to harden DNN models by detecting adversarial examples. Feature squeezing reduces the search space available to an adversary by coalescing samples that correspond to many different feature vectors in the original space into a single sample. By comparing a DNN model's prediction on the original input with that on squeezed inputs, feature squeezing detects adversarial examples with high accuracy and few false positives. This paper explores two feature squeezing methods: reducing the color bit depth of each pixel and spatial smoothing. These simple strategies are inexpensive and complementary to other defenses, and can be combined in a joint detection framework to achieve high detection rates against state-of-the-art attacks.

Comments:	To appear in Network and Distributed Systems Security Symposium (NDSS) 2018
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:1704.01155 [cs.CV]
	(or arXiv:1704.01155v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1704.01155
Related DOI:	https://doi.org/10.14722/ndss.2018.23198

Submission history

From: Weilin Xu [view email]
[v1] Tue, 4 Apr 2017 18:56:53 UTC (763 KB)
[v2] Tue, 5 Dec 2017 23:45:08 UTC (787 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators