3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network

In this paper, we propose a deep convolutional neural network for 3D human pose estimation from monocular images. We train the network using two strategies: (1) a multi-task framework that jointly trains pose regression and body part detectors; (2) a pre-training strategy where the pose regressor is...

Full description

Saved in:
Bibliographic Details
Published inComputer Vision -- ACCV 2014 pp. 332 - 347
Main Authors Li, Sijin, Chan, Antoni B.
Format Book Chapter
LanguageEnglish
Published Cham Springer International Publishing 2015
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In this paper, we propose a deep convolutional neural network for 3D human pose estimation from monocular images. We train the network using two strategies: (1) a multi-task framework that jointly trains pose regression and body part detectors; (2) a pre-training strategy where the pose regressor is initialized using a network trained for body part detection. We compare our network on a large data set and achieve significant improvement over baseline methods. Human pose estimation is a structured prediction problem, i.e., the locations of each body part are highly correlated. Although we do not add constraints about the correlations between body parts to the network, we empirically show that the network has disentangled the dependencies among different body parts, and learned their correlations.
Bibliography:Electronic supplementary materialThe online version of this chapter (doi:10.1007/978-3-319-16808-1_23) contains supplementary material, which is available to authorized users. Videos can also be accessed at http://www.springerimages.com/videos/978-3-319-16807-4.
ISBN:9783319168074
331916807X
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-319-16808-1_23