An Ensemble Framework of Voice-Based Emotion Recognition System
Emotion recognition will improve the user experience of artificial intelligence (AI) product. Voice-based emotion recognition system has low hardware requirement and can be easily employed onto current AI product, e.g. Amazon Echo, Google Home and so on. The previous voicebased emotion recognition s...
Saved in:
Published in | 2018 First Asian Conference on Affective Computing and Intelligent Interaction (ACII Asia) pp. 1 - 6 |
---|---|
Main Authors | , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.05.2018
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Emotion recognition will improve the user experience of artificial intelligence (AI) product. Voice-based emotion recognition system has low hardware requirement and can be easily employed onto current AI product, e.g. Amazon Echo, Google Home and so on. The previous voicebased emotion recognition system mostly focused on the speech collected under controlled environment, e.g. studio environment. However, the practical scenario will be more complicated than controlled environment. Only focusing on acoustic characteristics of speech is not sufficient to model the emotion. In this paper, we propose a ensemble framework which can capture several aspects of characteristics related to emotion. The framework is evaluated on multimodal emotion challenge (MEC) 2017 corpus. The corpus is collected from Chinese films and TV programs, whose scenarios are close to real world. The proposed framework is able to outperform the best baseline system by 29.5% absolutely. |
---|---|
AbstractList | Emotion recognition will improve the user experience of artificial intelligence (AI) product. Voice-based emotion recognition system has low hardware requirement and can be easily employed onto current AI product, e.g. Amazon Echo, Google Home and so on. The previous voicebased emotion recognition system mostly focused on the speech collected under controlled environment, e.g. studio environment. However, the practical scenario will be more complicated than controlled environment. Only focusing on acoustic characteristics of speech is not sufficient to model the emotion. In this paper, we propose a ensemble framework which can capture several aspects of characteristics related to emotion. The framework is evaluated on multimodal emotion challenge (MEC) 2017 corpus. The corpus is collected from Chinese films and TV programs, whose scenarios are close to real world. The proposed framework is able to outperform the best baseline system by 29.5% absolutely. |
Author | Tao, Fei Liu, Gang Zhao, Qingen |
Author_xml | – sequence: 1 givenname: Fei surname: Tao fullname: Tao, Fei organization: Multimodal Signal Processing (MSP) Lab, The University of Texas at Dallas, TX, Richardson – sequence: 2 givenname: Gang surname: Liu fullname: Liu, Gang organization: Alibaba Group – sequence: 3 givenname: Qingen surname: Zhao fullname: Zhao, Qingen organization: Alibaba Group |
BookMark | eNotj8tKxDAUQCPoQsf5AhHyA603SfNaSS0dLQwIvrZDmrmR4CSRtiDz94LO6pzVgXNFznPJSMgtg5oxsHdtNwztHF3NgZnaNBoEN2dkbbVhUhglBWPskty3mfZ5xjQekG4ml_CnTF-0BPpRosfqwc24p30qSyyZvqAvnzn--etxXjBdk4vgDjOuT1yR903_1j1V2-fHoWu3lWcNX6pgmLJaggSFBoJXQoPlCtB5IYX3jjWjQgUyANd71TjBAccRrNGK2zCKFbn570ZE3H1PMbnpuDttiV9OekZR |
CitedBy_id | crossref_primary_10_3389_fpsyg_2023_1129406 |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/ACIIAsia.2018.8470328 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 9781538653111 1538653117 |
EndPage | 6 |
ExternalDocumentID | 8470328 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-c142t-f8169750506e80fc63709260eac353cca14b6e605f027d64a320ebb0987629fb3 |
IEDL.DBID | RIE |
IngestDate | Thu Jun 29 18:39:34 EDT 2023 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c142t-f8169750506e80fc63709260eac353cca14b6e605f027d64a320ebb0987629fb3 |
PageCount | 6 |
ParticipantIDs | ieee_primary_8470328 |
PublicationCentury | 2000 |
PublicationDate | 2018-May |
PublicationDateYYYYMMDD | 2018-05-01 |
PublicationDate_xml | – month: 05 year: 2018 text: 2018-May |
PublicationDecade | 2010 |
PublicationTitle | 2018 First Asian Conference on Affective Computing and Intelligent Interaction (ACII Asia) |
PublicationTitleAbbrev | ACIIAsia |
PublicationYear | 2018 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 1.7674036 |
Snippet | Emotion recognition will improve the user experience of artificial intelligence (AI) product. Voice-based emotion recognition system has low hardware... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 1 |
SubjectTerms | Acoustics attention model deep learning Emotion recognition ensemble framework Feature extraction Hidden Markov models multi-task learning Neurons Speech recognition Task analysis |
Title | An Ensemble Framework of Voice-Based Emotion Recognition System |
URI | https://ieeexplore.ieee.org/document/8470328 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA7bTp5UNvE3OXg0XdImaXOSOTY2YSLiZLeRtC8gajtcd_GvN2m7ieLBWwiBJDx4Hy_5vu8hdCWVlTy2gjArJXGID8SozHgjZA5xqBWPvRp5di8nc363EIsWut5pYQCgIp9B4IfVX35WpBv_VNZ3mdTbv7VRO6FhrdVqRDmMqv5gOJ0O1i_eTIglQbP2R9OUCjPG-2i23a2mirwGm9IE6ecvI8b_HucA9b7VefhhhzuHqAV5F90McjzK1_Bu3gCPt4wrXFj8XLhcQG4dWmV4VDftwY9b2pAb157lPTQfj56GE9I0RyAp42FJbMKkcnAvqISE2lRGMVWuOHGJNBKRiwvjRoIrVqwrPDPJdRRSMIYqn_6UNdER6uRFDscICyM0A6O1MZn3b0-0kIam4GsjHSl-grr-8stV7X-xbO59-vf0GdrzAahJgeeoU35s4MIBd2kuq4h9AQG_mUw |
link.rule.ids | 310,311,786,790,795,796,802,27956,55107 |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFH7MedCTyib-NgePtkvXJG1OMsfGptsQ2WS30bQvIGorrrv415u03UTx4C2EQBIevI-XfN_3AK6E1IIFmjueFsIxiI-OkomyRsgMg3YkWWDVyOOJGMzY3ZzPa3C90cIgYkE-Q9cOi7_8JItX9qmsZTKptX_bgm2D8zQo1VqVLMejstXpDoed5bO1E_JCt1r9o21KgRr9PRiv9yvJIi_uKldu_PnLivG_B9qH5rc-jzxskOcAapg24KaTkl66xDf1iqS_5lyRTJOnzGQD59bgVUJ6Zdse8rgmDplx6VrehFm_N-0OnKo9ghN7rJ07OvSENIDPqcCQ6lj4AZWmPDGp1Oe-iYzHlEBTrmhTeiaCRX6bolJU2gQotfIPoZ5mKR4B4YpHHqooUiqxDu5hxIWiMdrqKPIlO4aGvfzivXTAWFT3Pvl7-hJ2BtPxaDEaTu5PYdcGo6QInkE9_1jhuYHxXF0U0fsCRu6coA |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2018+First+Asian+Conference+on+Affective+Computing+and+Intelligent+Interaction+%28ACII+Asia%29&rft.atitle=An+Ensemble+Framework+of+Voice-Based+Emotion+Recognition+System&rft.au=Tao%2C+Fei&rft.au=Liu%2C+Gang&rft.au=Zhao%2C+Qingen&rft.date=2018-05-01&rft.pub=IEEE&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FACIIAsia.2018.8470328&rft.externalDocID=8470328 |