TransReID: Transformer-based Object Re-Identification

He, Shuting; Luo, Hao; Wang, Pichao; Wang, Fan; Li, Hao; Jiang, Wei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2102.04378 (cs)

[Submitted on 8 Feb 2021 (v1), last revised 26 Mar 2021 (this version, v2)]

Title:TransReID: Transformer-based Object Re-Identification

Authors:Shuting He, Hao Luo, Pichao Wang, Fan Wang, Hao Li, Wei Jiang

View PDF

Abstract:Extracting robust feature representation is one of the key challenges in object re-identification (ReID). Although convolution neural network (CNN)-based methods have achieved great success, they only process one local neighborhood at a time and suffer from information loss on details caused by convolution and downsampling operators (e.g. pooling and strided convolution). To overcome these limitations, we propose a pure transformer-based object ReID framework named TransReID. Specifically, we first encode an image as a sequence of patches and build a transformer-based strong baseline with a few critical improvements, which achieves competitive results on several ReID benchmarks with CNN-based methods. To further enhance the robust feature learning in the context of transformers, two novel modules are carefully designed. (i) The jigsaw patch module (JPM) is proposed to rearrange the patch embeddings via shift and patch shuffle operations which generates robust features with improved discrimination ability and more diversified coverage. (ii) The side information embeddings (SIE) is introduced to mitigate feature bias towards camera/view variations by plugging in learnable embeddings to incorporate these non-visual clues. To the best of our knowledge, this is the first work to adopt a pure transformer for ReID research. Experimental results of TransReID are superior promising, which achieve state-of-the-art performance on both person and vehicle ReID benchmarks.

Comments:	Code is available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2102.04378 [cs.CV]
	(or arXiv:2102.04378v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2102.04378

Submission history

From: Shuting He [view email]
[v1] Mon, 8 Feb 2021 17:33:59 UTC (1,509 KB)
[v2] Fri, 26 Mar 2021 15:40:42 UTC (1,288 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:TransReID: Transformer-based Object Re-Identification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:TransReID: Transformer-based Object Re-Identification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators