VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization

Mishra, Pankaj; Verk, Riccardo; Fornasier, Daniele; Piciarelli, Claudio; Foresti, Gian Luca

doi:10.1109/ISIE45552.2021.9576231

Computer Science > Computer Vision and Pattern Recognition

arXiv:2104.10036 (cs)

[Submitted on 20 Apr 2021]

Title:VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization

Authors:Pankaj Mishra, Riccardo Verk, Daniele Fornasier, Claudio Piciarelli, Gian Luca Foresti

View PDF

Abstract:We present a transformer-based image anomaly detection and localization network. Our proposed model is a combination of a reconstruction-based approach and patch embedding. The use of transformer networks helps to preserve the spatial information of the embedded patches, which are later processed by a Gaussian mixture density network to localize the anomalous areas. In addition, we also publish BTAD, a real-world industrial anomaly dataset. Our results are compared with other state-of-the-art algorithms using publicly available datasets like MNIST and MVTec.

Comments:	6 Pages, 4 images, conference published paper
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Report number:	KD-003638
Cite as:	arXiv:2104.10036 [cs.CV]
	(or arXiv:2104.10036v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2104.10036
Journal reference:	IEEE 30th International Symposium on Industrial Electronics (ISIE), 2021
Related DOI:	https://doi.org/10.1109/ISIE45552.2021.9576231

Submission history

From: Pankaj Mishra [view email]
[v1] Tue, 20 Apr 2021 15:12:30 UTC (10,304 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-04

Change to browse by:

cs
cs.AI
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Claudio Piciarelli
Gian Luca Foresti

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators