Do we still need ImageNet pre-training in remote sensing scene classification?

Risojević, Vladimir; Stojnić, Vladan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2111.03690 (cs)

[Submitted on 5 Nov 2021 (v1), last revised 25 May 2022 (this version, v3)]

Title:Do we still need ImageNet pre-training in remote sensing scene classification?

Authors:Vladimir Risojević, Vladan Stojnić

View PDF

Abstract:Due to the scarcity of labeled data, using supervised models pre-trained on ImageNet is a de facto standard in remote sensing scene classification. Recently, the availability of larger high resolution remote sensing (HRRS) image datasets and progress in self-supervised learning have brought up the questions of whether supervised ImageNet pre-training is still necessary for remote sensing scene classification and would supervised pre-training on HRRS image datasets or self-supervised pre-training on ImageNet achieve better results on target remote sensing scene classification tasks. To answer these questions, in this paper we both train models from scratch and fine-tune supervised and self-supervised ImageNet models on several HRRS image datasets. We also evaluate the transferability of learned representations to HRRS scene classification tasks and show that self-supervised pre-training outperforms the supervised one, while the performance of HRRS pre-training is similar to self-supervised pre-training or slightly lower. Finally, we propose using an ImageNet pre-trained model combined with a second round of pre-training using in-domain HRRS images, i.e. domain-adaptive pre-training. The experimental results show that domain-adaptive pre-training results in models that achieve state-of-the-art results on HRRS scene classification benchmarks. The source code and pre-trained models are available at \url{this https URL}.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2111.03690 [cs.CV]
	(or arXiv:2111.03690v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2111.03690

Submission history

From: Vladimir Risojević [view email]
[v1] Fri, 5 Nov 2021 18:30:54 UTC (342 KB)
[v2] Fri, 17 Dec 2021 14:07:21 UTC (342 KB)
[v3] Wed, 25 May 2022 16:21:35 UTC (23 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Do we still need ImageNet pre-training in remote sensing scene classification?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Do we still need ImageNet pre-training in remote sensing scene classification?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators