An Ensemble Framework of Voice-Based Emotion Recognition System

Emotion recognition will improve the user experience of artificial intelligence (AI) product. Voice-based emotion recognition system has low hardware requirement and can be easily employed onto current AI product, e.g. Amazon Echo, Google Home and so on. The previous voicebased emotion recognition s...

Full description

Saved in:
Bibliographic Details
Published in2018 First Asian Conference on Affective Computing and Intelligent Interaction (ACII Asia) pp. 1 - 6
Main Authors Tao, Fei, Liu, Gang, Zhao, Qingen
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.05.2018
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Emotion recognition will improve the user experience of artificial intelligence (AI) product. Voice-based emotion recognition system has low hardware requirement and can be easily employed onto current AI product, e.g. Amazon Echo, Google Home and so on. The previous voicebased emotion recognition system mostly focused on the speech collected under controlled environment, e.g. studio environment. However, the practical scenario will be more complicated than controlled environment. Only focusing on acoustic characteristics of speech is not sufficient to model the emotion. In this paper, we propose a ensemble framework which can capture several aspects of characteristics related to emotion. The framework is evaluated on multimodal emotion challenge (MEC) 2017 corpus. The corpus is collected from Chinese films and TV programs, whose scenarios are close to real world. The proposed framework is able to outperform the best baseline system by 29.5% absolutely.
AbstractList Emotion recognition will improve the user experience of artificial intelligence (AI) product. Voice-based emotion recognition system has low hardware requirement and can be easily employed onto current AI product, e.g. Amazon Echo, Google Home and so on. The previous voicebased emotion recognition system mostly focused on the speech collected under controlled environment, e.g. studio environment. However, the practical scenario will be more complicated than controlled environment. Only focusing on acoustic characteristics of speech is not sufficient to model the emotion. In this paper, we propose a ensemble framework which can capture several aspects of characteristics related to emotion. The framework is evaluated on multimodal emotion challenge (MEC) 2017 corpus. The corpus is collected from Chinese films and TV programs, whose scenarios are close to real world. The proposed framework is able to outperform the best baseline system by 29.5% absolutely.
Author Tao, Fei
Liu, Gang
Zhao, Qingen
Author_xml – sequence: 1
  givenname: Fei
  surname: Tao
  fullname: Tao, Fei
  organization: Multimodal Signal Processing (MSP) Lab, The University of Texas at Dallas, TX, Richardson
– sequence: 2
  givenname: Gang
  surname: Liu
  fullname: Liu, Gang
  organization: Alibaba Group
– sequence: 3
  givenname: Qingen
  surname: Zhao
  fullname: Zhao, Qingen
  organization: Alibaba Group
BookMark eNotj8tKxDAUQCPoQsf5AhHyA603SfNaSS0dLQwIvrZDmrmR4CSRtiDz94LO6pzVgXNFznPJSMgtg5oxsHdtNwztHF3NgZnaNBoEN2dkbbVhUhglBWPskty3mfZ5xjQekG4ml_CnTF-0BPpRosfqwc24p30qSyyZvqAvnzn--etxXjBdk4vgDjOuT1yR903_1j1V2-fHoWu3lWcNX6pgmLJaggSFBoJXQoPlCtB5IYX3jjWjQgUyANd71TjBAccRrNGK2zCKFbn570ZE3H1PMbnpuDttiV9OekZR
CitedBy_id crossref_primary_10_3389_fpsyg_2023_1129406
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ACIIAsia.2018.8470328
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Xplore
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9781538653111
1538653117
EndPage 6
ExternalDocumentID 8470328
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-c142t-f8169750506e80fc63709260eac353cca14b6e605f027d64a320ebb0987629fb3
IEDL.DBID RIE
IngestDate Thu Jun 29 18:39:34 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c142t-f8169750506e80fc63709260eac353cca14b6e605f027d64a320ebb0987629fb3
PageCount 6
ParticipantIDs ieee_primary_8470328
PublicationCentury 2000
PublicationDate 2018-May
PublicationDateYYYYMMDD 2018-05-01
PublicationDate_xml – month: 05
  year: 2018
  text: 2018-May
PublicationDecade 2010
PublicationTitle 2018 First Asian Conference on Affective Computing and Intelligent Interaction (ACII Asia)
PublicationTitleAbbrev ACIIAsia
PublicationYear 2018
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.7674036
Snippet Emotion recognition will improve the user experience of artificial intelligence (AI) product. Voice-based emotion recognition system has low hardware...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Acoustics
attention model
deep learning
Emotion recognition
ensemble framework
Feature extraction
Hidden Markov models
multi-task learning
Neurons
Speech recognition
Task analysis
Title An Ensemble Framework of Voice-Based Emotion Recognition System
URI https://ieeexplore.ieee.org/document/8470328
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA7bTp5UNvE3OXg0XdImaXOSOTY2YSLiZLeRtC8gajtcd_GvN2m7ieLBWwiBJDx4Hy_5vu8hdCWVlTy2gjArJXGID8SozHgjZA5xqBWPvRp5di8nc363EIsWut5pYQCgIp9B4IfVX35WpBv_VNZ3mdTbv7VRO6FhrdVqRDmMqv5gOJ0O1i_eTIglQbP2R9OUCjPG-2i23a2mirwGm9IE6ecvI8b_HucA9b7VefhhhzuHqAV5F90McjzK1_Bu3gCPt4wrXFj8XLhcQG4dWmV4VDftwY9b2pAb157lPTQfj56GE9I0RyAp42FJbMKkcnAvqISE2lRGMVWuOHGJNBKRiwvjRoIrVqwrPDPJdRRSMIYqn_6UNdER6uRFDscICyM0A6O1MZn3b0-0kIam4GsjHSl-grr-8stV7X-xbO59-vf0GdrzAahJgeeoU35s4MIBd2kuq4h9AQG_mUw
link.rule.ids 310,311,786,790,795,796,802,27956,55107
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFH7MedCTyib-NgePtkvXJG1OMsfGptsQ2WS30bQvIGorrrv415u03UTx4C2EQBIevI-XfN_3AK6E1IIFmjueFsIxiI-OkomyRsgMg3YkWWDVyOOJGMzY3ZzPa3C90cIgYkE-Q9cOi7_8JItX9qmsZTKptX_bgm2D8zQo1VqVLMejstXpDoed5bO1E_JCt1r9o21KgRr9PRiv9yvJIi_uKldu_PnLivG_B9qH5rc-jzxskOcAapg24KaTkl66xDf1iqS_5lyRTJOnzGQD59bgVUJ6Zdse8rgmDplx6VrehFm_N-0OnKo9ghN7rJ07OvSENIDPqcCQ6lj4AZWmPDGp1Oe-iYzHlEBTrmhTeiaCRX6bolJU2gQotfIPoZ5mKR4B4YpHHqooUiqxDu5hxIWiMdrqKPIlO4aGvfzivXTAWFT3Pvl7-hJ2BtPxaDEaTu5PYdcGo6QInkE9_1jhuYHxXF0U0fsCRu6coA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2018+First+Asian+Conference+on+Affective+Computing+and+Intelligent+Interaction+%28ACII+Asia%29&rft.atitle=An+Ensemble+Framework+of+Voice-Based+Emotion+Recognition+System&rft.au=Tao%2C+Fei&rft.au=Liu%2C+Gang&rft.au=Zhao%2C+Qingen&rft.date=2018-05-01&rft.pub=IEEE&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FACIIAsia.2018.8470328&rft.externalDocID=8470328