Big Self-Supervised Models Advance Medical Image Classification

Azizi, Shekoofeh; Mustafa, Basil; Ryan, Fiona; Beaver, Zachary; Freyberg, Jan; Deaton, Jonathan; Loh, Aaron; Karthikesalingam, Alan; Kornblith, Simon; Chen, Ting; Natarajan, Vivek; Norouzi, Mohammad

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2101.05224 (eess)

[Submitted on 13 Jan 2021 (v1), last revised 1 Apr 2021 (this version, v2)]

Title:Big Self-Supervised Models Advance Medical Image Classification

Authors:Shekoofeh Azizi, Basil Mustafa, Fiona Ryan, Zachary Beaver, Jan Freyberg, Jonathan Deaton, Aaron Loh, Alan Karthikesalingam, Simon Kornblith, Ting Chen, Vivek Natarajan, Mohammad Norouzi

View PDF

Abstract:Self-supervised pretraining followed by supervised fine-tuning has seen success in image recognition, especially when labeled examples are scarce, but has received limited attention in medical image analysis. This paper studies the effectiveness of self-supervised learning as a pretraining strategy for medical image classification. We conduct experiments on two distinct tasks: dermatology skin condition classification from digital camera images and multi-label chest X-ray classification, and demonstrate that self-supervised learning on ImageNet, followed by additional self-supervised learning on unlabeled domain-specific medical images significantly improves the accuracy of medical image classifiers. We introduce a novel Multi-Instance Contrastive Learning (MICLe) method that uses multiple images of the underlying pathology per patient case, when available, to construct more informative positive pairs for self-supervised learning. Combining our contributions, we achieve an improvement of 6.7% in top-1 accuracy and an improvement of 1.1% in mean AUC on dermatology and chest X-ray classification respectively, outperforming strong supervised baselines pretrained on ImageNet. In addition, we show that big self-supervised models are robust to distribution shift and can learn efficiently with a small number of labeled medical images.

Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2101.05224 [eess.IV]
	(or arXiv:2101.05224v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2101.05224

Submission history

From: Shekoofeh Azizi [view email]
[v1] Wed, 13 Jan 2021 17:36:31 UTC (17,109 KB)
[v2] Thu, 1 Apr 2021 17:43:59 UTC (16,956 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Big Self-Supervised Models Advance Medical Image Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Big Self-Supervised Models Advance Medical Image Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators