VICRegL: Self-Supervised Learning of Local Visual Features

Bardes, Adrien; Ponce, Jean; LeCun, Yann

Computer Science > Computer Vision and Pattern Recognition

arXiv:2210.01571 (cs)

[Submitted on 4 Oct 2022]

Title:VICRegL: Self-Supervised Learning of Local Visual Features

Authors:Adrien Bardes, Jean Ponce, Yann LeCun

View PDF

Abstract:Most recent self-supervised methods for learning image representations focus on either producing a global feature with invariance properties, or producing a set of local features. The former works best for classification tasks while the latter is best for detection and segmentation tasks. This paper explores the fundamental trade-off between learning local and global features. A new method called VICRegL is proposed that learns good global and local features simultaneously, yielding excellent performance on detection and segmentation tasks while maintaining good performance on classification tasks. Concretely, two identical branches of a standard convolutional net architecture are fed two differently distorted versions of the same image. The VICReg criterion is applied to pairs of global feature vectors. Simultaneously, the VICReg criterion is applied to pairs of local feature vectors occurring before the last pooling layer. Two local feature vectors are attracted to each other if their l2-distance is below a threshold or if their relative locations are consistent with a known geometric transformation between the two input images. We demonstrate strong performance on linear classification and segmentation transfer tasks. Code and pretrained models are publicly available at: this https URL

Comments:	Accepted at NeurIPS 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2210.01571 [cs.CV]
	(or arXiv:2210.01571v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2210.01571

Submission history

From: Adrien Bardes [view email]
[v1] Tue, 4 Oct 2022 12:54:25 UTC (12,934 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:VICRegL: Self-Supervised Learning of Local Visual Features

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:VICRegL: Self-Supervised Learning of Local Visual Features

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators