Squeeze-and-Attention Networks for Semantic Segmentation

Zhong, Zilong; Lin, Zhong Qiu; Bidart, Rene; Hu, Xiaodan; Daya, Ibrahim Ben; Li, Jonathan; Wong, Alexander

Computer Science > Computer Vision and Pattern Recognition

arXiv:1909.03402v2 (cs)

[Submitted on 8 Sep 2019 (v1), revised 10 Sep 2019 (this version, v2), latest version 1 Apr 2020 (v4)]

Title:Squeeze-and-Attention Networks for Semantic Segmentation

Authors:Zilong Zhong, Zhong Qiu Lin, Rene Bidart, Xiaodan Hu, Ibrahim Ben Daya, Jonathan Li, Alexander Wong

View PDF

Abstract:Squeeze-and-excitation (SE) module enhances the representational power of convolution layers by adaptively re-calibrating channel-wise feature responses. However, the limitation of SE in terms of attention characterization lies in the loss of spatial information cues, making it less well suited for perception tasks with very high spatial inter-dependencies such as semantic segmentation. In this paper, we propose a novel squeeze-and-attention network (SANet) architecture that leverages a simple but effective squeeze-and-attention (SA) module to account for two distinctive characteristics of segmentation: i) pixel-group attention, and ii) pixel-wise prediction. Specifically, the proposed SA modules impose pixel-group attention on conventional convolution by introducing an 'attention' convolutional channel, thus taking into account spatial-channel inter-dependencies in an efficient manner. The final segmentation results are produced by merging outputs from four hierarchical stages of a SANet to integrate multi-scale contexts for obtaining enhanced pixel-wise prediction. Empirical experiments using two challenging public datasets validate the effectiveness of the proposed SANets, which achieved 83.2% mIoU (without COCO pre-training) on PASCAL VOC and a state-of-the-art mIoU of 54.4% on PASCAL Context.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1909.03402 [cs.CV]
	(or arXiv:1909.03402v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1909.03402

Submission history

From: Zilong Zhong [view email]
[v1] Sun, 8 Sep 2019 08:21:57 UTC (2,191 KB)
[v2] Tue, 10 Sep 2019 03:38:20 UTC (2,191 KB)
[v3] Thu, 27 Feb 2020 07:14:17 UTC (2,623 KB)
[v4] Wed, 1 Apr 2020 05:50:33 UTC (2,645 KB)

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computer Vision and Pattern Recognition

Title:Squeeze-and-Attention Networks for Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computer Vision and Pattern Recognition

Title:Squeeze-and-Attention Networks for Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators