Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach

Yan, Min; Ning, Qianxiong; Wang, Qian

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.03508 (cs)

[Submitted on 6 Jun 2023]

Title:Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach

Authors:Min Yan, Qianxiong Ning, Qian Wang

View PDF

Abstract:Video scene parsing incorporates temporal information, which can enhance the consistency and accuracy of predictions compared to image scene parsing. The added temporal dimension enables a more comprehensive understanding of the scene, leading to more reliable results. This paper presents the winning solution of the CVPR2023 workshop for video semantic segmentation, focusing on enhancing Spatial-Temporal correlations with contrastive loss. We also explore the influence of multi-dataset training by utilizing a label-mapping technique. And the final result is aggregating the output of the above two models. Our approach achieves 65.95% mIoU performance on the VSPW dataset, ranked 1st place on the VSPW challenge at CVPR 2023.

Comments:	1st Place Solution for CVPR 2023 PVUW VSS Track
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.03508 [cs.CV]
	(or arXiv:2306.03508v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.03508

Submission history

From: Qianxiong Ning [view email]
[v1] Tue, 6 Jun 2023 08:53:53 UTC (20 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2023-06

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators