On Efficient Variants of Segment Anything Model: A Survey

Sun, Xiaorui; Liu, Jun; Shen, Heng Tao; Zhu, Xiaofeng; Hu, Ping

Computer Science > Computer Vision and Pattern Recognition

arXiv:2410.04960 (cs)

[Submitted on 7 Oct 2024 (v1), last revised 18 Oct 2024 (this version, v2)]

Title:On Efficient Variants of Segment Anything Model: A Survey

Authors:Xiaorui Sun, Jun Liu, Heng Tao Shen, Xiaofeng Zhu, Ping Hu

View PDF HTML (experimental)

Abstract:The Segment Anything Model (SAM) is a foundational model for image segmentation tasks, known for its strong generalization across diverse applications. However, its impressive performance comes with significant computational and resource demands, making it challenging to deploy in resource-limited environments such as edge devices. To address this, a variety of SAM variants have been proposed to enhance efficiency while keeping accuracy. This survey provides the first comprehensive review of these efficient SAM variants. We begin by exploring the motivations driving this research. We then present core techniques used in SAM and model acceleration. This is followed by a detailed exploration of SAM acceleration strategies, categorized by approach, and a discussion of several future research directions. Finally, we offer a unified and extensive evaluation of these methods across various hardware, assessing their efficiency and accuracy on representative benchmarks, and providing a clear comparison of their overall performance.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2410.04960 [cs.CV]
	(or arXiv:2410.04960v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2410.04960

Submission history

From: Ping Hu [view email]
[v1] Mon, 7 Oct 2024 11:59:54 UTC (8,546 KB)
[v2] Fri, 18 Oct 2024 14:42:50 UTC (6,117 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:On Efficient Variants of Segment Anything Model: A Survey

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:On Efficient Variants of Segment Anything Model: A Survey

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators