An Ensemble Framework of Voice-Based Emotion Recognition System

Emotion recognition will improve the user experience of artificial intelligence (AI) product. Voice-based emotion recognition system has low hardware requirement and can be easily employed onto current AI product, e.g. Amazon Echo, Google Home and so on. The previous voicebased emotion recognition s...

Full description

Saved in:

Bibliographic Details
Published in	2018 First Asian Conference on Affective Computing and Intelligent Interaction (ACII Asia) pp. 1 - 6
Main Authors	Tao, Fei, Liu, Gang, Zhao, Qingen
Format	Conference Proceeding
Language	English
Published	IEEE 01.05.2018
Subjects	Acoustics attention model deep learning Emotion recognition ensemble framework Feature extraction Hidden Markov models multi-task learning Neurons Speech recognition Task analysis
Online Access	Get full text

Cover

Loading…

Abstract	Emotion recognition will improve the user experience of artificial intelligence (AI) product. Voice-based emotion recognition system has low hardware requirement and can be easily employed onto current AI product, e.g. Amazon Echo, Google Home and so on. The previous voicebased emotion recognition system mostly focused on the speech collected under controlled environment, e.g. studio environment. However, the practical scenario will be more complicated than controlled environment. Only focusing on acoustic characteristics of speech is not sufficient to model the emotion. In this paper, we propose a ensemble framework which can capture several aspects of characteristics related to emotion. The framework is evaluated on multimodal emotion challenge (MEC) 2017 corpus. The corpus is collected from Chinese films and TV programs, whose scenarios are close to real world. The proposed framework is able to outperform the best baseline system by 29.5% absolutely.
AbstractList	Emotion recognition will improve the user experience of artificial intelligence (AI) product. Voice-based emotion recognition system has low hardware requirement and can be easily employed onto current AI product, e.g. Amazon Echo, Google Home and so on. The previous voicebased emotion recognition system mostly focused on the speech collected under controlled environment, e.g. studio environment. However, the practical scenario will be more complicated than controlled environment. Only focusing on acoustic characteristics of speech is not sufficient to model the emotion. In this paper, we propose a ensemble framework which can capture several aspects of characteristics related to emotion. The framework is evaluated on multimodal emotion challenge (MEC) 2017 corpus. The corpus is collected from Chinese films and TV programs, whose scenarios are close to real world. The proposed framework is able to outperform the best baseline system by 29.5% absolutely.
Author	Tao, Fei Liu, Gang Zhao, Qingen
Author_xml	– sequence: 1 givenname: Fei surname: Tao fullname: Tao, Fei organization: Multimodal Signal Processing (MSP) Lab, The University of Texas at Dallas, TX, Richardson – sequence: 2 givenname: Gang surname: Liu fullname: Liu, Gang organization: Alibaba Group – sequence: 3 givenname: Qingen surname: Zhao fullname: Zhao, Qingen organization: Alibaba Group
BookMark	eNotj8tKxDAUQCPoQsf5AhHyA603SfNaSS0dLQwIvrZDmrmR4CSRtiDz94LO6pzVgXNFznPJSMgtg5oxsHdtNwztHF3NgZnaNBoEN2dkbbVhUhglBWPskty3mfZ5xjQekG4ml_CnTF-0BPpRosfqwc24p30qSyyZvqAvnzn--etxXjBdk4vgDjOuT1yR903_1j1V2-fHoWu3lWcNX6pgmLJaggSFBoJXQoPlCtB5IYX3jjWjQgUyANd71TjBAccRrNGK2zCKFbn570ZE3H1PMbnpuDttiV9OekZR
CitedBy_id	crossref_primary_10_3389_fpsyg_2023_1129406
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/ACIIAsia.2018.8470328
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Xplore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
EISBN	9781538653111 1538653117
EndPage	6
ExternalDocumentID	8470328
Genre	orig-research
GroupedDBID	6IE 6IL CBEJK RIE RIL
ID	FETCH-LOGICAL-c142t-f8169750506e80fc63709260eac353cca14b6e605f027d64a320ebb0987629fb3
IEDL.DBID	RIE
IngestDate	Thu Jun 29 18:39:34 EDT 2023
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c142t-f8169750506e80fc63709260eac353cca14b6e605f027d64a320ebb0987629fb3
PageCount	6
ParticipantIDs	ieee_primary_8470328
PublicationCentury	2000
PublicationDate	2018-May
PublicationDateYYYYMMDD	2018-05-01
PublicationDate_xml	– month: 05 year: 2018 text: 2018-May
PublicationDecade	2010
PublicationTitle	2018 First Asian Conference on Affective Computing and Intelligent Interaction (ACII Asia)
PublicationTitleAbbrev	ACIIAsia
PublicationYear	2018
Publisher	IEEE
Publisher_xml	– name: IEEE
Score	1.7674036
Snippet	Emotion recognition will improve the user experience of artificial intelligence (AI) product. Voice-based emotion recognition system has low hardware...
SourceID	ieee
SourceType	Publisher
StartPage	1
SubjectTerms	Acoustics attention model deep learning Emotion recognition ensemble framework Feature extraction Hidden Markov models multi-task learning Neurons Speech recognition Task analysis
Title	An Ensemble Framework of Voice-Based Emotion Recognition System
URI	https://ieeexplore.ieee.org/document/8470328
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA7bTp5UNvE3OXg0XdImaXOSOTY2YSLiZLeRtC8gajtcd_GvN2m7ieLBWwiBJDx4Hy_5vu8hdCWVlTy2gjArJXGID8SozHgjZA5xqBWPvRp5di8nc363EIsWut5pYQCgIp9B4IfVX35WpBv_VNZ3mdTbv7VRO6FhrdVqRDmMqv5gOJ0O1i_eTIglQbP2R9OUCjPG-2i23a2mirwGm9IE6ecvI8b_HucA9b7VefhhhzuHqAV5F90McjzK1_Bu3gCPt4wrXFj8XLhcQG4dWmV4VDftwY9b2pAb157lPTQfj56GE9I0RyAp42FJbMKkcnAvqISE2lRGMVWuOHGJNBKRiwvjRoIrVqwrPDPJdRRSMIYqn_6UNdER6uRFDscICyM0A6O1MZn3b0-0kIam4GsjHSl-grr-8stV7X-xbO59-vf0GdrzAahJgeeoU35s4MIBd2kuq4h9AQG_mUw
link.rule.ids	310,311,786,790,795,796,802,27956,55107
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFH7MedCTyib-NgePtkvXJG1OMsfGptsQ2WS30bQvIGorrrv415u03UTx4C2EQBIevI-XfN_3AK6E1IIFmjueFsIxiI-OkomyRsgMg3YkWWDVyOOJGMzY3ZzPa3C90cIgYkE-Q9cOi7_8JItX9qmsZTKptX_bgm2D8zQo1VqVLMejstXpDoed5bO1E_JCt1r9o21KgRr9PRiv9yvJIi_uKldu_PnLivG_B9qH5rc-jzxskOcAapg24KaTkl66xDf1iqS_5lyRTJOnzGQD59bgVUJ6Zdse8rgmDplx6VrehFm_N-0OnKo9ghN7rJ07OvSENIDPqcCQ6lj4AZWmPDGp1Oe-iYzHlEBTrmhTeiaCRX6bolJU2gQotfIPoZ5mKR4B4YpHHqooUiqxDu5hxIWiMdrqKPIlO4aGvfzivXTAWFT3Pvl7-hJ2BtPxaDEaTu5PYdcGo6QInkE9_1jhuYHxXF0U0fsCRu6coA
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2018+First+Asian+Conference+on+Affective+Computing+and+Intelligent+Interaction+%28ACII+Asia%29&rft.atitle=An+Ensemble+Framework+of+Voice-Based+Emotion+Recognition+System&rft.au=Tao%2C+Fei&rft.au=Liu%2C+Gang&rft.au=Zhao%2C+Qingen&rft.date=2018-05-01&rft.pub=IEEE&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FACIIAsia.2018.8470328&rft.externalDocID=8470328