Facing Imbalanced Data--Recommendations for the Use of Performance Metrics
Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the mac...
Saved in:
Published in | International Conference on Affective Computing and Intelligent Interaction and workshops Vol. 2013; pp. 245 - 251 |
---|---|
Main Authors | , , |
Format | Conference Proceeding Journal Article |
Language | English |
Published |
United States
IEEE
01.01.2013
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the machine learning community has become aware of the problem of skewed data for training classifiers, little attention has been paid to how skew may bias performance metrics. To address this question, we conducted experiments using both simulated classifiers and three major databases that differ in size, type of FACS coding, and degree of skew. We evaluated influence of skew on both threshold metrics (Accuracy, F-score, Cohen's kappa, and Krippendorf's alpha) and rank metrics (area under the receiver operating characteristic (ROC) curve and precision-recall curve). With exception of area under the ROC curve, all were attenuated by skewed distributions, in many cases, dramatically so. While ROC was unaffected by skew, precision-recall curves suggest that ROC may mask poor performance. Our findings suggest that skew is a critical factor in evaluating performance metrics. To avoid or minimize skew-biased estimates of performance, we recommend reporting skew-normalized scores along with the obtained ones. |
---|---|
AbstractList | Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the machine learning community has become aware of the problem of skewed data for training classifiers, little attention has been paid to how skew may bias performance metrics. To address this question, we conducted experiments using both simulated classifiers and three major databases that differ in size, type of FACS coding, and degree of skew. We evaluated influence of skew on both threshold metrics (Accuracy, F-score, Cohen's kappa, and Krippendorf's alpha) and rank metrics (area under the receiver operating characteristic (ROC) curve and precision-recall curve). With exception of area under the ROC curve, all were attenuated by skewed distributions, in many cases, dramatically so. While ROC was unaffected by skew, precision-recall curves suggest that ROC may mask poor performance. Our findings suggest that skew is a critical factor in evaluating performance metrics. To avoid or minimize skew-biased estimates of performance, we recommend reporting skew-normalized scores along with the obtained ones. Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the machine learning community has become aware of the problem of skewed data for training classifiers, little attention has been paid to how skew may bias performance metrics. To address this question, we conducted experiments using both simulated classifiers and three major databases that differ in size, type of FACS coding, and degree of skew. We evaluated influence of skew on both threshold metrics (Accuracy, F-score, Cohen's kappa, and Krippendorf's alpha) and rank metrics (area under the receiver operating characteristic (ROC) curve and precision-recall curve). With exception of area under the ROC curve, all were attenuated by skewed distributions, in many cases, dramatically so. While ROC was unaffected by skew, precision-recall curves suggest that ROC may mask poor performance. Our findings suggest that skew is a critical factor in evaluating performance metrics. To avoid or minimize skew-biased estimates of performance, we recommend reporting skew-normalized scores along with the obtained ones.Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the machine learning community has become aware of the problem of skewed data for training classifiers, little attention has been paid to how skew may bias performance metrics. To address this question, we conducted experiments using both simulated classifiers and three major databases that differ in size, type of FACS coding, and degree of skew. We evaluated influence of skew on both threshold metrics (Accuracy, F-score, Cohen's kappa, and Krippendorf's alpha) and rank metrics (area under the receiver operating characteristic (ROC) curve and precision-recall curve). With exception of area under the ROC curve, all were attenuated by skewed distributions, in many cases, dramatically so. While ROC was unaffected by skew, precision-recall curves suggest that ROC may mask poor performance. Our findings suggest that skew is a critical factor in evaluating performance metrics. To avoid or minimize skew-biased estimates of performance, we recommend reporting skew-normalized scores along with the obtained ones. |
Author | Cohn, Jeffrey F. De La Torre, Fernando Jeni, Laszlo A. |
AuthorAffiliation | 1 Carnegie Mellon University, Pittsburgh, PA 2 University of Pittsburgh, Pittsburgh, PA, jeffcohn@cs.cmu.edu |
AuthorAffiliation_xml | – name: 2 University of Pittsburgh, Pittsburgh, PA, jeffcohn@cs.cmu.edu – name: 1 Carnegie Mellon University, Pittsburgh, PA |
Author_xml | – sequence: 1 givenname: Laszlo A. surname: Jeni fullname: Jeni, Laszlo A. email: laszlo.jeni@ieee.org organization: Carnegie Mellon Univ., Pittsburgh, PA, USA – sequence: 2 givenname: Jeffrey F. surname: Cohn fullname: Cohn, Jeffrey F. email: jeffcohn@cs.cmu.edu organization: Carnegie Mellon Univ., Pittsburgh, PA, USA – sequence: 3 givenname: Fernando surname: De La Torre fullname: De La Torre, Fernando email: ftorre@cs.cmu.edu organization: Carnegie Mellon Univ., Pittsburgh, PA, USA |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/25574450$$D View this record in MEDLINE/PubMed |
BookMark | eNpVkM1Lw0AQxVep2Fp78yZIjl5S9zO7exFKtVqpKGLPYbOZtAtJtmZTwf_eSGvR0wzzfvMeM2eoV_saELogeEwI1jeT6Xw-ppiwMZdHaKSlwjLRQmCu8DEaUCKSWBFCeocesz4aheAyTBOZME7pKepTISTnAg_Q08xYV6-ieZWZ0tQW8ujOtCaO38D6qoI6N63zdYgK30TtGqJlgMgX0Ss03aT62YieoW2cDefopDBlgNG-DtFydv8-fYwXLw_z6WQRO6ZxG0smcYYxkZbqgogu0SiRC2ZsRjmBQmmLsySngnEOWnPLM0MtMEUyanVh2BDd7nw326yC3ELdNqZMN42rTPOVeuPS_0rt1unKf6acKsGE6Ayu9waN_9hCaNPKBQtldz_4bUiJYiKhkhDZoVd_sw4hvw_sgMsd4ADgICeJIpwp9g3Uk4CZ |
CODEN | IEEPAD |
ContentType | Conference Proceeding Journal Article |
DBID | 6IE 6IL CBEJK RIE RIL NPM 7X8 5PM |
DOI | 10.1109/ACII.2013.47 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present PubMed MEDLINE - Academic PubMed Central (Full Participant titles) |
DatabaseTitle | PubMed MEDLINE - Academic |
DatabaseTitleList | MEDLINE - Academic PubMed |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISBN | 9780769550480 0769550487 |
EISSN | 2156-8111 |
EndPage | 251 |
ExternalDocumentID | PMC4285355 25574450 6681438 |
Genre | orig-research Journal Article |
GrantInformation_xml | – fundername: NIMH NIH HHS grantid: R01 MH096951 |
GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI M43 OCL RIE RIL NPM 7X8 5PM |
ID | FETCH-LOGICAL-i390t-7370b0017c29f15ceda85d53acb241ef89c0b6d25344e994c4ba2ce381b2c9fa3 |
IEDL.DBID | RIE |
ISSN | 2156-8103 |
IngestDate | Thu Aug 21 13:33:58 EDT 2025 Fri Jul 11 07:20:32 EDT 2025 Thu Jan 02 22:31:08 EST 2025 Wed Aug 27 04:02:57 EDT 2025 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i390t-7370b0017c29f15ceda85d53acb241ef89c0b6d25344e994c4ba2ce381b2c9fa3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 laszlo.jeni@ieee.org, ftorre@cs.cmu.edu |
OpenAccessLink | http://doi.org/10.1109/ACII.2013.47 |
PMID | 25574450 |
PQID | 1835627117 |
PQPubID | 23479 |
PageCount | 7 |
ParticipantIDs | proquest_miscellaneous_1835627117 ieee_primary_6681438 pubmed_primary_25574450 pubmedcentral_primary_oai_pubmedcentral_nih_gov_4285355 |
PublicationCentury | 2000 |
PublicationDate | 2013-01-01 |
PublicationDateYYYYMMDD | 2013-01-01 |
PublicationDate_xml | – month: 01 year: 2013 text: 2013-01-01 day: 01 |
PublicationDecade | 2010 |
PublicationPlace | United States |
PublicationPlace_xml | – name: United States |
PublicationTitle | International Conference on Affective Computing and Intelligent Interaction and workshops |
PublicationTitleAbbrev | acii |
PublicationTitleAlternate | Int Conf Affect Comput Intell Interact Workshops |
PublicationYear | 2013 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssib026763422 ssj0001950885 |
Score | 2.1234884 |
Snippet | Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and... |
SourceID | pubmedcentral proquest pubmed ieee |
SourceType | Open Access Repository Aggregation Database Index Database Publisher |
StartPage | 245 |
SubjectTerms | Accuracy action unit detection Gold imbalanced data Measurement Pain performance metrics Shape skew normalization Three-dimensional displays |
Title | Facing Imbalanced Data--Recommendations for the Use of Performance Metrics |
URI | https://ieeexplore.ieee.org/document/6681438 https://www.ncbi.nlm.nih.gov/pubmed/25574450 https://www.proquest.com/docview/1835627117 https://pubmed.ncbi.nlm.nih.gov/PMC4285355 |
Volume | 2013 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3JTsMwEB2VnjixFSibjMSRlNSxY_uIgIoigXqgErfKWwRCpIimF76esdOGRT1wixTHcbxk3oyf5wGcUYFWJHUysZaHaJXTiQn6JtIZL7VUnOpIkH3Ib8fs7ok_teC8OQvjvY_kM98Ll3Ev303tPITKLvJcBrXuNVhDx60-q7WcOzTHhcIWqehifCXKmwYGIxq1PJH9NGt47-ri8mo4DLyurNfoqqyCmH-Zkj9Mz2AD7peNrhknr715ZXr2808-x_9-1SZ0vg_5kVFjvrag5ctt2FiqPJDFot-Bu4G2WIAM30ygQVrvyLWudJIEz_UN665VmWYE4S9BOEnGM0-mBRl9H0kg90G3y846MB7cPF7dJgsFhuQlU2mViEyk0U20VBV9jm_QkjueaWvQ8vtCKpua3FGeMeaVYpYZTa1HFGCoVYXOdqFdTku_D0RRx5kVwmTMYdkAzBz-UArjgsenZBd2QtdM3uskG5NFr3ThdDkqE5z4YTdDl346n02wBsRuot8XXdirR6l5GP0kwRhPuyB-jV9TICTV_n2nfHmOybXRHeOIwQ5WN-cQ1mnUwwgxmCNoVx9zf4yopDIncTp-AQm734k |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT9swFH4CdoATYxRWtoEncSQldezYPk6MqmW06qGVuEX-FYEm0mlNL_z1e3baMBAHbpHiOI5_5H3v-fP7AM6pQCuSOplYy0O0yunEBH0T6YyXWipOdSTITvLhnN3c8bstuGjPwnjvI_nM98Jl3Mt3C7sKobLLPJdBrXsbPqDd57Q5rbWZPTTHpcLWyehihCUKnAYOI5q1PJH9NGuZ7-ryx9VoFJhdWa9VVnkLZL7mSv5nfAb7MN40u-Gc_O6tatOzT68yOr73uz5C5_mYH5m2BuwAtnz1CfY3Og9kvewP4WagLRYgo0cTiJDWO_JT1zpJgu_6iHU3ukxLggCYIKAk86Uni5JMnw8lkHFQ7rLLDswH17OrYbLWYEgeMpXWichEGh1FS1XZ5_gGLbnjmbYGbb8vpbKpyR3lGWNeKWaZ0dR6xAGGWlXq7Ah2qkXlPwNR1HFmhTAZc1g2QDOHv5TSuODzKdmFw9A1xZ8mzUax7pUufN-MSoFTP-xn6MovVssCa0D0Jvp90YXjZpTah9FTEozxtAvixfi1BUJa7Zd3qof7mF4bHTKOKOzk7eacwe5wNr4tbkeTX19gj0Z1jBCR-Qo79d-V_4YYpTancWr-A9vZ4tM |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=International+Conference+on+Affective+Computing+and+Intelligent+Interaction+and+workshops&rft.atitle=Facing+Imbalanced+Data--Recommendations+for+the+Use+of+Performance+Metrics&rft.au=Jeni%2C+Laszlo+A.&rft.au=Cohn%2C+Jeffrey+F.&rft.au=De+La+Torre%2C+Fernando&rft.date=2013-01-01&rft.pub=IEEE&rft.issn=2156-8103&rft.spage=245&rft.epage=251&rft_id=info:doi/10.1109%2FACII.2013.47&rft.externalDocID=6681438 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2156-8103&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2156-8103&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2156-8103&client=summon |