Facing Imbalanced Data--Recommendations for the Use of Performance Metrics

Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the mac...

Full description

Saved in:
Bibliographic Details
Published inInternational Conference on Affective Computing and Intelligent Interaction and workshops Vol. 2013; pp. 245 - 251
Main Authors Jeni, Laszlo A., Cohn, Jeffrey F., De La Torre, Fernando
Format Conference Proceeding Journal Article
LanguageEnglish
Published United States IEEE 01.01.2013
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the machine learning community has become aware of the problem of skewed data for training classifiers, little attention has been paid to how skew may bias performance metrics. To address this question, we conducted experiments using both simulated classifiers and three major databases that differ in size, type of FACS coding, and degree of skew. We evaluated influence of skew on both threshold metrics (Accuracy, F-score, Cohen's kappa, and Krippendorf's alpha) and rank metrics (area under the receiver operating characteristic (ROC) curve and precision-recall curve). With exception of area under the ROC curve, all were attenuated by skewed distributions, in many cases, dramatically so. While ROC was unaffected by skew, precision-recall curves suggest that ROC may mask poor performance. Our findings suggest that skew is a critical factor in evaluating performance metrics. To avoid or minimize skew-biased estimates of performance, we recommend reporting skew-normalized scores along with the obtained ones.
AbstractList Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the machine learning community has become aware of the problem of skewed data for training classifiers, little attention has been paid to how skew may bias performance metrics. To address this question, we conducted experiments using both simulated classifiers and three major databases that differ in size, type of FACS coding, and degree of skew. We evaluated influence of skew on both threshold metrics (Accuracy, F-score, Cohen's kappa, and Krippendorf's alpha) and rank metrics (area under the receiver operating characteristic (ROC) curve and precision-recall curve). With exception of area under the ROC curve, all were attenuated by skewed distributions, in many cases, dramatically so. While ROC was unaffected by skew, precision-recall curves suggest that ROC may mask poor performance. Our findings suggest that skew is a critical factor in evaluating performance metrics. To avoid or minimize skew-biased estimates of performance, we recommend reporting skew-normalized scores along with the obtained ones.
Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the machine learning community has become aware of the problem of skewed data for training classifiers, little attention has been paid to how skew may bias performance metrics. To address this question, we conducted experiments using both simulated classifiers and three major databases that differ in size, type of FACS coding, and degree of skew. We evaluated influence of skew on both threshold metrics (Accuracy, F-score, Cohen's kappa, and Krippendorf's alpha) and rank metrics (area under the receiver operating characteristic (ROC) curve and precision-recall curve). With exception of area under the ROC curve, all were attenuated by skewed distributions, in many cases, dramatically so. While ROC was unaffected by skew, precision-recall curves suggest that ROC may mask poor performance. Our findings suggest that skew is a critical factor in evaluating performance metrics. To avoid or minimize skew-biased estimates of performance, we recommend reporting skew-normalized scores along with the obtained ones.Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the machine learning community has become aware of the problem of skewed data for training classifiers, little attention has been paid to how skew may bias performance metrics. To address this question, we conducted experiments using both simulated classifiers and three major databases that differ in size, type of FACS coding, and degree of skew. We evaluated influence of skew on both threshold metrics (Accuracy, F-score, Cohen's kappa, and Krippendorf's alpha) and rank metrics (area under the receiver operating characteristic (ROC) curve and precision-recall curve). With exception of area under the ROC curve, all were attenuated by skewed distributions, in many cases, dramatically so. While ROC was unaffected by skew, precision-recall curves suggest that ROC may mask poor performance. Our findings suggest that skew is a critical factor in evaluating performance metrics. To avoid or minimize skew-biased estimates of performance, we recommend reporting skew-normalized scores along with the obtained ones.
Author Cohn, Jeffrey F.
De La Torre, Fernando
Jeni, Laszlo A.
AuthorAffiliation 1 Carnegie Mellon University, Pittsburgh, PA
2 University of Pittsburgh, Pittsburgh, PA, jeffcohn@cs.cmu.edu
AuthorAffiliation_xml – name: 2 University of Pittsburgh, Pittsburgh, PA, jeffcohn@cs.cmu.edu
– name: 1 Carnegie Mellon University, Pittsburgh, PA
Author_xml – sequence: 1
  givenname: Laszlo A.
  surname: Jeni
  fullname: Jeni, Laszlo A.
  email: laszlo.jeni@ieee.org
  organization: Carnegie Mellon Univ., Pittsburgh, PA, USA
– sequence: 2
  givenname: Jeffrey F.
  surname: Cohn
  fullname: Cohn, Jeffrey F.
  email: jeffcohn@cs.cmu.edu
  organization: Carnegie Mellon Univ., Pittsburgh, PA, USA
– sequence: 3
  givenname: Fernando
  surname: De La Torre
  fullname: De La Torre, Fernando
  email: ftorre@cs.cmu.edu
  organization: Carnegie Mellon Univ., Pittsburgh, PA, USA
BackLink https://www.ncbi.nlm.nih.gov/pubmed/25574450$$D View this record in MEDLINE/PubMed
BookMark eNpVkM1Lw0AQxVep2Fp78yZIjl5S9zO7exFKtVqpKGLPYbOZtAtJtmZTwf_eSGvR0wzzfvMeM2eoV_saELogeEwI1jeT6Xw-ppiwMZdHaKSlwjLRQmCu8DEaUCKSWBFCeocesz4aheAyTBOZME7pKepTISTnAg_Q08xYV6-ieZWZ0tQW8ujOtCaO38D6qoI6N63zdYgK30TtGqJlgMgX0Ss03aT62YieoW2cDefopDBlgNG-DtFydv8-fYwXLw_z6WQRO6ZxG0smcYYxkZbqgogu0SiRC2ZsRjmBQmmLsySngnEOWnPLM0MtMEUyanVh2BDd7nw326yC3ELdNqZMN42rTPOVeuPS_0rt1unKf6acKsGE6Ayu9waN_9hCaNPKBQtldz_4bUiJYiKhkhDZoVd_sw4hvw_sgMsd4ADgICeJIpwp9g3Uk4CZ
CODEN IEEPAD
ContentType Conference Proceeding
Journal Article
DBID 6IE
6IL
CBEJK
RIE
RIL
NPM
7X8
5PM
DOI 10.1109/ACII.2013.47
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
PubMed
MEDLINE - Academic
PubMed Central (Full Participant titles)
DatabaseTitle PubMed
MEDLINE - Academic
DatabaseTitleList
MEDLINE - Academic

PubMed
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9780769550480
0769550487
EISSN 2156-8111
EndPage 251
ExternalDocumentID PMC4285355
25574450
6681438
Genre orig-research
Journal Article
GrantInformation_xml – fundername: NIMH NIH HHS
  grantid: R01 MH096951
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
M43
OCL
RIE
RIL
NPM
7X8
5PM
ID FETCH-LOGICAL-i390t-7370b0017c29f15ceda85d53acb241ef89c0b6d25344e994c4ba2ce381b2c9fa3
IEDL.DBID RIE
ISSN 2156-8103
IngestDate Thu Aug 21 13:33:58 EDT 2025
Fri Jul 11 07:20:32 EDT 2025
Thu Jan 02 22:31:08 EST 2025
Wed Aug 27 04:02:57 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i390t-7370b0017c29f15ceda85d53acb241ef89c0b6d25344e994c4ba2ce381b2c9fa3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
laszlo.jeni@ieee.org, ftorre@cs.cmu.edu
OpenAccessLink http://doi.org/10.1109/ACII.2013.47
PMID 25574450
PQID 1835627117
PQPubID 23479
PageCount 7
ParticipantIDs proquest_miscellaneous_1835627117
ieee_primary_6681438
pubmed_primary_25574450
pubmedcentral_primary_oai_pubmedcentral_nih_gov_4285355
PublicationCentury 2000
PublicationDate 2013-01-01
PublicationDateYYYYMMDD 2013-01-01
PublicationDate_xml – month: 01
  year: 2013
  text: 2013-01-01
  day: 01
PublicationDecade 2010
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle International Conference on Affective Computing and Intelligent Interaction and workshops
PublicationTitleAbbrev acii
PublicationTitleAlternate Int Conf Affect Comput Intell Interact Workshops
PublicationYear 2013
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssib026763422
ssj0001950885
Score 2.1234884
Snippet Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and...
SourceID pubmedcentral
proquest
pubmed
ieee
SourceType Open Access Repository
Aggregation Database
Index Database
Publisher
StartPage 245
SubjectTerms Accuracy
action unit detection
Gold
imbalanced data
Measurement
Pain
performance metrics
Shape
skew normalization
Three-dimensional displays
Title Facing Imbalanced Data--Recommendations for the Use of Performance Metrics
URI https://ieeexplore.ieee.org/document/6681438
https://www.ncbi.nlm.nih.gov/pubmed/25574450
https://www.proquest.com/docview/1835627117
https://pubmed.ncbi.nlm.nih.gov/PMC4285355
Volume 2013
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3JTsMwEB2VnjixFSibjMSRlNSxY_uIgIoigXqgErfKWwRCpIimF76esdOGRT1wixTHcbxk3oyf5wGcUYFWJHUysZaHaJXTiQn6JtIZL7VUnOpIkH3Ib8fs7ok_teC8OQvjvY_kM98Ll3Ev303tPITKLvJcBrXuNVhDx60-q7WcOzTHhcIWqehifCXKmwYGIxq1PJH9NGt47-ri8mo4DLyurNfoqqyCmH-Zkj9Mz2AD7peNrhknr715ZXr2808-x_9-1SZ0vg_5kVFjvrag5ctt2FiqPJDFot-Bu4G2WIAM30ygQVrvyLWudJIEz_UN665VmWYE4S9BOEnGM0-mBRl9H0kg90G3y846MB7cPF7dJgsFhuQlU2mViEyk0U20VBV9jm_QkjueaWvQ8vtCKpua3FGeMeaVYpYZTa1HFGCoVYXOdqFdTku_D0RRx5kVwmTMYdkAzBz-UArjgsenZBd2QtdM3uskG5NFr3ThdDkqE5z4YTdDl346n02wBsRuot8XXdirR6l5GP0kwRhPuyB-jV9TICTV_n2nfHmOybXRHeOIwQ5WN-cQ1mnUwwgxmCNoVx9zf4yopDIncTp-AQm734k
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT9swFH4CdoATYxRWtoEncSQldezYPk6MqmW06qGVuEX-FYEm0mlNL_z1e3baMBAHbpHiOI5_5H3v-fP7AM6pQCuSOplYy0O0yunEBH0T6YyXWipOdSTITvLhnN3c8bstuGjPwnjvI_nM98Jl3Mt3C7sKobLLPJdBrXsbPqDd57Q5rbWZPTTHpcLWyehihCUKnAYOI5q1PJH9NGuZ7-ryx9VoFJhdWa9VVnkLZL7mSv5nfAb7MN40u-Gc_O6tatOzT68yOr73uz5C5_mYH5m2BuwAtnz1CfY3Og9kvewP4WagLRYgo0cTiJDWO_JT1zpJgu_6iHU3ukxLggCYIKAk86Uni5JMnw8lkHFQ7rLLDswH17OrYbLWYEgeMpXWichEGh1FS1XZ5_gGLbnjmbYGbb8vpbKpyR3lGWNeKWaZ0dR6xAGGWlXq7Ah2qkXlPwNR1HFmhTAZc1g2QDOHv5TSuODzKdmFw9A1xZ8mzUax7pUufN-MSoFTP-xn6MovVssCa0D0Jvp90YXjZpTah9FTEozxtAvixfi1BUJa7Zd3qof7mF4bHTKOKOzk7eacwe5wNr4tbkeTX19gj0Z1jBCR-Qo79d-V_4YYpTancWr-A9vZ4tM
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=International+Conference+on+Affective+Computing+and+Intelligent+Interaction+and+workshops&rft.atitle=Facing+Imbalanced+Data--Recommendations+for+the+Use+of+Performance+Metrics&rft.au=Jeni%2C+Laszlo+A.&rft.au=Cohn%2C+Jeffrey+F.&rft.au=De+La+Torre%2C+Fernando&rft.date=2013-01-01&rft.pub=IEEE&rft.issn=2156-8103&rft.spage=245&rft.epage=251&rft_id=info:doi/10.1109%2FACII.2013.47&rft.externalDocID=6681438
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2156-8103&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2156-8103&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2156-8103&client=summon