Speech emotion recognition using emotion perception spectral feature

Summary Speech emotion recognition is an important technique for human‐computer interface applications. Due to contain rich information of emotion, the spectral feature is widely used for emotion recognition. However, the recognition performance is limited because of imprecise extracted rule and unc...

Full description

Saved in:

Bibliographic Details
Published in	Concurrency and computation Vol. 33; no. 11
Main Authors	Jiang, Lin, Tan, Ping, Yang, Junfeng, Liu, Xingbao, Wang, Chao
Format	Journal Article
Language	English
Published	Hoboken Wiley Subscription Services, Inc 10.06.2021
Subjects	emotion perception Emotion recognition Emotions Feature extraction Feature recognition Perception perception subband partition Psychoacoustics Spectra spectral feature Speech recognition Support vector machines
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Summary Speech emotion recognition is an important technique for human‐computer interface applications. Due to contain rich information of emotion, the spectral feature is widely used for emotion recognition. However, the recognition performance is limited because of imprecise extracted rule and uncertain size of resolution of spectral feature. To address this issue, motivated by speech coding, we introduced psychoacoustics model, provided a perception spectral subband partition method for obtaining more precise frequency resolution. Moreover, we also provided a new spectral feature on the divided subband frequency signals. The proposed feature includes emotional perception entropy, spectral inclination, and spectral flatness. Then, a Support Vector Machine classifier is used to recognize emotion categories. The experiment results show that the proposed spectral feature is superior to the traditional MFCC feature, and also better than the state‐of‐the‐art Fourier feature and multi‐resolution amplitude feature.
ISSN:	1532-0626 1532-0634
DOI:	10.1002/cpe.5427