Facial optical flow estimation via neural non-rigid registration

Optical flow estimation in human facial video, which provides 2D correspondences between adjacent frames, is a fundamental pre-processing step for many applications, like facial expression capture and recognition. However, it is quite challenging as human facial images contain large areas of similar...

Full description

Saved in:
Bibliographic Details
Published inComputational visual media (Beijing) Vol. 9; no. 1; pp. 109 - 122
Main Authors Peng, Zhuang, Jiang, Boyi, Xu, Haofei, Feng, Wanquan, Zhang, Juyong
Format Journal Article
LanguageEnglish
Published Beijing Tsinghua University Press 01.03.2023
Springer Nature B.V
SpringerOpen
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Optical flow estimation in human facial video, which provides 2D correspondences between adjacent frames, is a fundamental pre-processing step for many applications, like facial expression capture and recognition. However, it is quite challenging as human facial images contain large areas of similar textures, rich expressions, and large rotations. These characteristics also result in the scarcity of large, annotated real-world datasets. We propose a robust and accurate method to learn facial optical flow in a self-supervised manner. Specifically, we utilize various shape priors, including face depth, landmarks, and parsing, to guide the self-supervised learning task via a differentiable nonrigid registration framework. Extensive experiments demonstrate that our method achieves remarkable improvements for facial optical flow estimation in the presence of significant expressions and large rotations.
ISSN:2096-0433
2096-0662
DOI:10.1007/s41095-021-0267-z