Convolutional Pose Machines

Wei, Shih-En; Ramakrishna, Varun; Kanade, Takeo; Sheikh, Yaser

Computer Science > Computer Vision and Pattern Recognition

arXiv:1602.00134 (cs)

[Submitted on 30 Jan 2016 (v1), last revised 12 Apr 2016 (this version, v4)]

Title:Convolutional Pose Machines

Authors:Shih-En Wei, Varun Ramakrishna, Takeo Kanade, Yaser Sheikh

View PDF

Abstract:Pose Machines provide a sequential prediction framework for learning rich implicit spatial models. In this work we show a systematic design for how convolutional networks can be incorporated into the pose machine framework for learning image features and image-dependent spatial models for the task of pose estimation. The contribution of this paper is to implicitly model long-range dependencies between variables in structured prediction tasks such as articulated pose estimation. We achieve this by designing a sequential architecture composed of convolutional networks that directly operate on belief maps from previous stages, producing increasingly refined estimates for part locations, without the need for explicit graphical model-style inference. Our approach addresses the characteristic difficulty of vanishing gradients during training by providing a natural learning objective function that enforces intermediate supervision, thereby replenishing back-propagated gradients and conditioning the learning procedure. We demonstrate state-of-the-art performance and outperform competing methods on standard benchmarks including the MPII, LSP, and FLIC datasets.

Comments:	camera ready
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1602.00134 [cs.CV]
	(or arXiv:1602.00134v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1602.00134

Submission history

From: Shih-En Wei [view email]
[v1] Sat, 30 Jan 2016 16:15:28 UTC (5,133 KB)
[v2] Tue, 2 Feb 2016 04:58:41 UTC (5,133 KB)
[v3] Mon, 28 Mar 2016 10:22:17 UTC (7,091 KB)
[v4] Tue, 12 Apr 2016 03:31:53 UTC (11,781 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Convolutional Pose Machines

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Convolutional Pose Machines

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators