Dynamic Convolution: Attention over Convolution Kernels

Chen, Yinpeng; Dai, Xiyang; Liu, Mengchen; Chen, Dongdong; Yuan, Lu; Liu, Zicheng

Computer Science > Computer Vision and Pattern Recognition

arXiv:1912.03458 (cs)

[Submitted on 7 Dec 2019 (v1), last revised 31 Mar 2020 (this version, v2)]

Title:Dynamic Convolution: Attention over Convolution Kernels

Authors:Yinpeng Chen, Xiyang Dai, Mengchen Liu, Dongdong Chen, Lu Yuan, Zicheng Liu

View PDF

Abstract:Light-weight convolutional neural networks (CNNs) suffer performance degradation as their low computational budgets constrain both the depth (number of convolution layers) and the width (number of channels) of CNNs, resulting in limited representation capability. To address this issue, we present Dynamic Convolution, a new design that increases model complexity without increasing the network depth or width. Instead of using a single convolution kernel per layer, dynamic convolution aggregates multiple parallel convolution kernels dynamically based upon their attentions, which are input dependent. Assembling multiple kernels is not only computationally efficient due to the small kernel size, but also has more representation power since these kernels are aggregated in a non-linear way via attention. By simply using dynamic convolution for the state-of-the-art architecture MobileNetV3-Small, the top-1 accuracy of ImageNet classification is boosted by 2.9% with only 4% additional FLOPs and 2.9 AP gain is achieved on COCO keypoint detection.

Comments:	CVPR 2020 (Oral)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1912.03458 [cs.CV]
	(or arXiv:1912.03458v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1912.03458

Submission history

From: Yinpeng Chen [view email]
[v1] Sat, 7 Dec 2019 07:51:35 UTC (678 KB)
[v2] Tue, 31 Mar 2020 21:56:49 UTC (856 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2019-12

Change to browse by:

cs.CV

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yinpeng Chen
Xiyang Dai
Mengchen Liu
Dongdong Chen
Lu Yuan

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Dynamic Convolution: Attention over Convolution Kernels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Dynamic Convolution: Attention over Convolution Kernels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators