Your heart rate betrays you: multimodal learning with spatio-temporal fusion networks for micro-expression recognition

Micro-expressions can convey feelings that people are trying to hide. At present, some studies on micro-expression, most of which only use the temporal or spatial information in the image to recognize micro-expressions, neglect the intrinsic features of the image. To solve this problem, we detect th...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of multimedia information retrieval Vol. 11; no. 4; pp. 553 - 566
Main Authors Zhang, Ren, He, Ning, Liu, Shengjie, Wu, Ying, Yan, Kang, He, Yuzhe, Lu, Ke
Format Journal Article
LanguageEnglish
Published London Springer London 01.12.2022
Springer Nature B.V
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Micro-expressions can convey feelings that people are trying to hide. At present, some studies on micro-expression, most of which only use the temporal or spatial information in the image to recognize micro-expressions, neglect the intrinsic features of the image. To solve this problem, we detect the subject’s heart rate in the long micro-expression videos; we extract the image’s spatio-temporal feature through a spatio-temporal network and then extract the heart rate feature using a heart rate network. A multimodal learning method that combines the heart rate and spatio-temporal features is used to recognize micro-expressions. The experimental results on CASMEII , SAMM , and SMIC show that the proposed methods’ unweighted F1-score and unweighted average recall are 0.8867 and 0.8962, respectively. The spatio-temporal fusion network combined with heart rate information provides an essential reference for multimodal approaches to micro-expression recognition.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2192-6611
2192-662X
DOI:10.1007/s13735-022-00250-9