Dense 3D face alignment from 2D video for real-time use

To enable real-time, person-independent 3D registration from 2D video, we developed a 3D cascade regression approach in which facial landmarks remain invariant across pose over a range of approximately 60°. From a single 2D image of a person's face, a dense 3D shape is registered in real time f...

Full description

Saved in:
Bibliographic Details
Published inImage and vision computing Vol. 58; pp. 13 - 24
Main Authors Jeni, László A., Cohn, Jeffrey F., Kanade, Takeo
Format Journal Article
LanguageEnglish
Published England Elsevier B.V 01.02.2017
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:To enable real-time, person-independent 3D registration from 2D video, we developed a 3D cascade regression approach in which facial landmarks remain invariant across pose over a range of approximately 60°. From a single 2D image of a person's face, a dense 3D shape is registered in real time for each frame. The algorithm utilizes a fast cascade regression framework trained on high-resolution 3D face-scans of posed and spontaneous emotion expression. The algorithm first estimates the location of a dense set of landmarks and their visibility, then reconstructs face shapes by fitting a part-based 3D model. Because no assumptions are required about illumination or surface properties, the method can be applied to a wide range of imaging conditions that include 2D video and uncalibrated multi-view video. The method has been validated in a battery of experiments that evaluate its precision of 3D reconstruction, extension to multi-view reconstruction, temporal integration for videos and 3D head-pose estimation. Experimental findings strongly support the validity of real-time, 3D registration and reconstruction from 2D video. The software is available online at http://zface.org. [Display omitted] •3D cascade regression approach is proposed in which facial landmarks remain invariant.•From a single 2D image of a person's face, a dense 3D shape is registered in real time for each frame.•Multi-view reconstruction and temporal integration for videos are presented.•Method is robust for 3D head-pose estimation under various conditions.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0262-8856
1872-8138
DOI:10.1016/j.imavis.2016.05.009