Accuracy improvement of facial expression recognition in speech acts: Confirmation of effectiveness of information around a mouth and GAN-based data augmentation

With the growth of the social robot market, much research has been undertaken on facial expression recognition, which is an important function of a social robot. Facial expression recognition models have shown good performance in a facial expression image dataset that expresses emotion without consi...

Full description

Saved in:
Bibliographic Details
Published inIEEE RO-MAN pp. 1 - 6
Main Authors Song, Kyu-Seob, Kwon, Dong-Soo
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.10.2019
Subjects
Online AccessGet full text
ISSN1944-9437
DOI10.1109/RO-MAN46459.2019.8956436

Cover

Loading…
Abstract With the growth of the social robot market, much research has been undertaken on facial expression recognition, which is an important function of a social robot. Facial expression recognition models have shown good performance in a facial expression image dataset that expresses emotion without considering speaking effect. However, in reality, humans often express emotions by speaking and moving the muscles around the mouth. Therefore, the lack of consideration of speech leads to unsatisfactory emotion recognition results. In this paper, we investigated two points to be considered in learning a facial expression recognition model. First, we confirmed whether the information around a mouth induces the recognition model in speech act to misrecognition like the case of a facial expression recognition in non-speech acts or it has valid information for facial expression recognition. Second, Generative Adversarial Network (GAN)-based data augmentation has been performed to cover the problem in which the accuracy of the recognition model in speech acts is low because of the relatively small variance about the subject in RML dataset. The results showed that the information around the mouth made facial expression recognition in speech acts exhibit higher performance, unlike the case of facial expression recognition in non-speech acts. In addition, the GAN-based data augmentation alleviated the accuracy degradation in facial expression recognition because of the low variance of the dataset.
AbstractList With the growth of the social robot market, much research has been undertaken on facial expression recognition, which is an important function of a social robot. Facial expression recognition models have shown good performance in a facial expression image dataset that expresses emotion without considering speaking effect. However, in reality, humans often express emotions by speaking and moving the muscles around the mouth. Therefore, the lack of consideration of speech leads to unsatisfactory emotion recognition results. In this paper, we investigated two points to be considered in learning a facial expression recognition model. First, we confirmed whether the information around a mouth induces the recognition model in speech act to misrecognition like the case of a facial expression recognition in non-speech acts or it has valid information for facial expression recognition. Second, Generative Adversarial Network (GAN)-based data augmentation has been performed to cover the problem in which the accuracy of the recognition model in speech acts is low because of the relatively small variance about the subject in RML dataset. The results showed that the information around the mouth made facial expression recognition in speech acts exhibit higher performance, unlike the case of facial expression recognition in non-speech acts. In addition, the GAN-based data augmentation alleviated the accuracy degradation in facial expression recognition because of the low variance of the dataset.
Author Song, Kyu-Seob
Kwon, Dong-Soo
Author_xml – sequence: 1
  givenname: Kyu-Seob
  surname: Song
  fullname: Song, Kyu-Seob
  organization: Korea Advanced Institute of Science and Technology,Human-Robot Interaction Research Center,The Republic of Korea
– sequence: 2
  givenname: Dong-Soo
  surname: Kwon
  fullname: Kwon, Dong-Soo
  organization: Korea Advanced Institute of Science and Technology,Human-Robot Interaction Research Center,The Republic of Korea
BookMark eNo1UMtOwzAQNAgkoPQLuPgHUuJXHHOLKihIpZUQnCvHXhejxo6ctKKfw5-SlrKXGe3sjEZ7gy5CDIAQJvmEkFzdvy2z12rBCy7UhOZETUolCs6KMzRWsiSSloQWlMpzdE0U55niTF6hcdd95cMoTogU1-inMmabtNlj37Qp7qCB0OPosNPG6w2G7zZB1_kYcAIT18H3B-4D7loA84m16bsHPI3B-dToozi4wTkwvd9BGMyHhQ8u_us6xW2wWOMmbvshYeCzapHVugOLre411tv1ocfx_BZdOr3pYHzCEfp4enyfPmfz5exlWs0zT3PWZ7UpHOOWGlCMqprX1rjSaEkstUqUTBfEMSttTYgoc8OZEEopywSVzNnSsBG6-8v1ALBqk2902q9OT2W_a1JyaQ
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/RO-MAN46459.2019.8956436
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 9781728126227
1728126223
EISSN 1944-9437
EndPage 6
ExternalDocumentID 8956436
Genre orig-research
GroupedDBID 6IE
6IF
6IH
6IK
6IL
6IN
AAJGR
AAWTH
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IPLJI
OCL
RIE
RIL
RNS
ID FETCH-LOGICAL-i203t-bc6f34d2ce9329b4bdcf8ca71d2d9583a61f3d7db11580c4355999d35273fd8c3
IEDL.DBID RIE
IngestDate Wed Aug 06 17:54:56 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i203t-bc6f34d2ce9329b4bdcf8ca71d2d9583a61f3d7db11580c4355999d35273fd8c3
PageCount 6
ParticipantIDs ieee_primary_8956436
PublicationCentury 2000
PublicationDate 2019-Oct.
PublicationDateYYYYMMDD 2019-10-01
PublicationDate_xml – month: 10
  year: 2019
  text: 2019-Oct.
PublicationDecade 2010
PublicationTitle IEEE RO-MAN
PublicationTitleAbbrev ROMAN
PublicationYear 2019
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000941175
Score 2.0980198
Snippet With the growth of the social robot market, much research has been undertaken on facial expression recognition, which is an important function of a social...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Accuracy
Data augmentation
Emotion recognition
Face recognition
Mouth
Muscles
Social robots
Speech
Speech recognition
Training
Title Accuracy improvement of facial expression recognition in speech acts: Confirmation of effectiveness of information around a mouth and GAN-based data augmentation
URI https://ieeexplore.ieee.org/document/8956436
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELbaTrDwaBFveWDEbRo7icNWIaBCakGISmyVfXagQk2rNhng3_BP8SVpeYiB7eTIsuWccl_O33dHyJlULs5oL2IilsCEr4HJQMZMe10LsVAiVihOHgzD_kjcPgVPNXK-1sJYawvymW2jWdzlmxnkmCrrSAfmBQ_rpO7crNRqrfMp7jcFq06uyDpe3Hm4Y4PeEG_uUJDSdT5RTv_RR6UII9dbZLDaQMkeeW3nmW7D-6_ajP_d4TZpfQn26P06FO2Qmk13yea3WoNN8tEDyBcK3uikyCMUaUE6S2iiMGtO3WolJTala1KRsycpXc6thReqIFteUFxwUukdcXbJB6k-mThQVWItnqsFNm2iik6xUR9Vzr7pDRlGTkORm0pV_jyt1E9pi4yurx4v-6zqz8AmvsczpiFMuDA-WAcCYy20gUSCirrGN3EguQq7CTeR0Q51Sg8cMAscHDUca74lRgLfI410ltp9Qk0g_SQMY4eeQRjLFfAwiiCKJJcqSNQBaeJhj-dlCY5xdc6Hfw8fkQ184SXn7pg0skVuTxx2yPRp4TSfRgDIEQ
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3JTsMwELVYDsCFXez4wBGXNHYSh1uFgLK0IESl3ip77ECFmlZtcoC_4U_xJKEs4sDNcpTYcqzMy_i9N4QcSeXijPYiJmIJTPgamAxkzLRXtxALJWKF4uRWO2x2xHU36M6Q46kWxlpbkM9sDZvFWb4ZQo6pshPpwLzg4SyZd3FfBKVaa5pRcT8q6Dv5Sdfx4pOHO9ZqtPHsDiUpdbcrygf8qKRSBJKLZdL6nELJH3mp5Zmuwdsvd8b_znGFbHxJ9uj9NBitkhmbrpGlb26D6-S9AZCPFbzSfpFJKBKDdJjQRGHenLrRSlJsSqe0Itfup3QyshaeqYJsckpxwH6leMS7S0ZI9dHEjsqLtbiuxli2iSo6wFJ9VLn2ZaPNMHYaiuxUqvKnQaV_SjdI5-L88azJqgoNrO97PGMawoQL44N1MDDWQhtIJKiobnwTB5KrsJ5wExntcKf0wEGzwAFSw9H1LTES-CaZS4ep3SLUBNJPwjB2-BmEsVwBD6MIokhyqYJEbZN1XOzeqDTh6FXrvPN39yFZaD62bnu3V-2bXbKIL79k4O2RuWyc232HJDJ9UGygD-NCy14
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE+RO-MAN&rft.atitle=Accuracy+improvement+of+facial+expression+recognition+in+speech+acts%3A+Confirmation+of+effectiveness+of+information+around+a+mouth+and+GAN-based+data+augmentation&rft.au=Song%2C+Kyu-Seob&rft.au=Kwon%2C+Dong-Soo&rft.date=2019-10-01&rft.pub=IEEE&rft.eissn=1944-9437&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FRO-MAN46459.2019.8956436&rft.externalDocID=8956436