Speech emotion recognition using emotion perception spectral feature

Summary Speech emotion recognition is an important technique for human‐computer interface applications. Due to contain rich information of emotion, the spectral feature is widely used for emotion recognition. However, the recognition performance is limited because of imprecise extracted rule and unc...

Full description

Saved in:
Bibliographic Details
Published inConcurrency and computation Vol. 33; no. 11
Main Authors Jiang, Lin, Tan, Ping, Yang, Junfeng, Liu, Xingbao, Wang, Chao
Format Journal Article
LanguageEnglish
Published Hoboken Wiley Subscription Services, Inc 10.06.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Summary Speech emotion recognition is an important technique for human‐computer interface applications. Due to contain rich information of emotion, the spectral feature is widely used for emotion recognition. However, the recognition performance is limited because of imprecise extracted rule and uncertain size of resolution of spectral feature. To address this issue, motivated by speech coding, we introduced psychoacoustics model, provided a perception spectral subband partition method for obtaining more precise frequency resolution. Moreover, we also provided a new spectral feature on the divided subband frequency signals. The proposed feature includes emotional perception entropy, spectral inclination, and spectral flatness. Then, a Support Vector Machine classifier is used to recognize emotion categories. The experiment results show that the proposed spectral feature is superior to the traditional MFCC feature, and also better than the state‐of‐the‐art Fourier feature and multi‐resolution amplitude feature.
ISSN:1532-0626
1532-0634
DOI:10.1002/cpe.5427