A multi-modal feature fusion framework for kinect-based facial expression recognition using Dual Kernel Discriminant Analysis (DKDA)

We present a multi-modal feature fusion framework for Kinect-based Facial Expression Recognition (FER). The framework extracts and pre-processes 2D and 3D features separately. The types of 2D and 3D features are selected to maximize the accuracy of the system, with the Histogram of Oriented Gradient...

Full description

Saved in:
Bibliographic Details
Published in2016 IEEE Winter Conference on Applications of Computer Vision (WACV) pp. 1 - 10
Main Authors Aly, Sherin, Abbott, A. Lynn, Torki, Marwan
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.03.2016
Subjects
Online AccessGet full text
DOI10.1109/WACV.2016.7477577

Cover

Abstract We present a multi-modal feature fusion framework for Kinect-based Facial Expression Recognition (FER). The framework extracts and pre-processes 2D and 3D features separately. The types of 2D and 3D features are selected to maximize the accuracy of the system, with the Histogram of Oriented Gradient (HOG) features for 2D data and statistically selected angles for 3D data giving the best performance. The sets of 2D features and 3D features are reduced and later combined using a novel Dual Kernel Discriminant Analysis (DKDA) approach. Final classification is done using SVMs. The framework is benchmarked on a public Kinect-based FER dataset which includes data for 32 subjects (in both frontal and non-frontal poses and two expression intensities) and 6 basic expressions (plus neutral), namely: happiness, sadness, anger, disgust, fear, and surprise. The framework shows that the proposed combination of 2D and 3D features outperforms simpler existing combinations of 2D and 3D features, as well as systems that use either 2D or 3D features only. The proposed system also outperforms Linear Discriminant Analysis (LDA)-transformed and traditional Kernel Discriminant Analysis (KDA)-transformed systems, with an average accuracy improving of 10%. It also outperforms the state of the art by more than 13% in frontal poses.
AbstractList We present a multi-modal feature fusion framework for Kinect-based Facial Expression Recognition (FER). The framework extracts and pre-processes 2D and 3D features separately. The types of 2D and 3D features are selected to maximize the accuracy of the system, with the Histogram of Oriented Gradient (HOG) features for 2D data and statistically selected angles for 3D data giving the best performance. The sets of 2D features and 3D features are reduced and later combined using a novel Dual Kernel Discriminant Analysis (DKDA) approach. Final classification is done using SVMs. The framework is benchmarked on a public Kinect-based FER dataset which includes data for 32 subjects (in both frontal and non-frontal poses and two expression intensities) and 6 basic expressions (plus neutral), namely: happiness, sadness, anger, disgust, fear, and surprise. The framework shows that the proposed combination of 2D and 3D features outperforms simpler existing combinations of 2D and 3D features, as well as systems that use either 2D or 3D features only. The proposed system also outperforms Linear Discriminant Analysis (LDA)-transformed and traditional Kernel Discriminant Analysis (KDA)-transformed systems, with an average accuracy improving of 10%. It also outperforms the state of the art by more than 13% in frontal poses.
Author Aly, Sherin
Torki, Marwan
Abbott, A. Lynn
Author_xml – sequence: 1
  givenname: Sherin
  surname: Aly
  fullname: Aly, Sherin
  email: sherin@vt.edu
  organization: Bradley Dept. of Electr. & Comput. Eng, Virginia Tech, Blacksburg, VA, USA
– sequence: 2
  givenname: A. Lynn
  surname: Abbott
  fullname: Abbott, A. Lynn
  email: abbott@vt.edu
  organization: Bradley Dept. of Electr. & Comput. Eng, Virginia Tech, Blacksburg, VA, USA
– sequence: 3
  givenname: Marwan
  surname: Torki
  fullname: Torki, Marwan
  email: mtorki@alexu.edu.eg
  organization: Dept. of Comput. Eng., Alexandria Univ., Alexandria, Egypt
BookMark eNotkD1PwzAYhI0EErT0ByAWjzCkvI7d2B6jhi-1EgsfY-UkryuriVPZiaA7P5wAne6Ge06nm5BT33kk5IrBnDHQdx_58n2eAsvmUki5kPKETNgCNEAmmD4nsxhdCRwYgMr0BfnOaTs0vUvarjYNtWj6ISC1Q3SdpzaYFj-7sKO2C3TnPFZ9UpqINbWmciOAX_uA8S8csOq23vW_fsT9lhbDmFhh8NjQwsUquNZ543uae9Mcoov0plgV-e0lObOmiTg76pS8Pdy_Lp-S9cvj8zJfJy4F1SdWZyUXC42IHDAtRcp5Bai4FowzyUxpwIgylaBSqVSZcqytVbWS1tYoNJ-S6_9eN1Zs9uMcEw6b41H8B0avYpM
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/WACV.2016.7477577
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 1509006419
9781509006410
EndPage 10
ExternalDocumentID 7477577
Genre orig-research
GroupedDBID 6IE
6IL
ALMA_UNASSIGNED_HOLDINGS
CBEJK
RIB
RIC
RIE
RIL
ID FETCH-LOGICAL-i208t-f96b3459eee30e2b4233c0e839413171aba0a4b27082788b23edff8d87ffde493
IEDL.DBID RIE
IngestDate Wed Dec 20 05:18:49 EST 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i208t-f96b3459eee30e2b4233c0e839413171aba0a4b27082788b23edff8d87ffde493
PageCount 10
ParticipantIDs ieee_primary_7477577
PublicationCentury 2000
PublicationDate 20160301
PublicationDateYYYYMMDD 2016-03-01
PublicationDate_xml – month: 03
  year: 2016
  text: 20160301
  day: 01
PublicationDecade 2010
PublicationTitle 2016 IEEE Winter Conference on Applications of Computer Vision (WACV)
PublicationTitleAbbrev WACV
PublicationYear 2016
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssib030100869
Score 1.6799953
Snippet We present a multi-modal feature fusion framework for Kinect-based Facial Expression Recognition (FER). The framework extracts and pre-processes 2D and 3D...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Face
Face recognition
Feature extraction
Histograms
Kernel
Sensors
Three-dimensional displays
Title A multi-modal feature fusion framework for kinect-based facial expression recognition using Dual Kernel Discriminant Analysis (DKDA)
URI https://ieeexplore.ieee.org/document/7477577
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LTwIxEG6Akyc1YHynBw-aWNjtdne7RwISIsF4EOVG2u3UEHAxZPfi2R_udBcwGg_emqZNX5PON9NvpoRcac59FZmUgVSciUgKpqUQDI0NK2So4iR10cjjh2g4EffTcFojt7tYGAAoyWfQdsXyLd-s0sK5yjoIfeMwjuukjmJWxWptZQfl1KHzZPNw6XtJ56Xbe3bcrai96ffjA5VSfwz2yXg7ckUbWbSLXLfTj19JGf87tQPS-o7Uo487HXRIapA1yWeXljxB9rYyakktlMk7qS2cZ4zaLR2LIl6lC0SZac6cMjPUKudApzh-xY7N6I5fhGVHkX-l_QJbjGCdwZL25-7Sqcg0dJvehF73R_3uTYtMBndPvSHb_LbA5tyTObNJpAMRJri0wAOuEWcFqQcIoFDP-bGvtPKU0DxG0IB2s-YBGGulkbG1BkQSHJFGtsrgmFADRusE2yi0efEGk2mYeJHkyvqA9g0_IU23g7P3KqHGbLN5p39Xn5E9d4oV8eucNPJ1AReIBHJ9WYrAF1pmthE
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwELX4GGACVBDfeGAACbeJ4yTOWFFQoS1iaKFbZcdnVLWkCCULMz-cc9IWgRjYrMiRE_vk9-787kzIuebcV5FJGUjFmYikYFoKwdDZsEKGKk5Sl43ce4jaA3E_DIcr5GqZCwMApfgM6q5ZnuWbWVq4UFkDqW8cxvEqWUfcF2GVrbWwHrRUx8-T-dGl7yWN5-b1k1NvRfX5mz-uUCkR5HaL9BZjV8KRSb3IdT39-FWW8b8ft012v3P16OMShXbICmQ18tmkpVKQvc6MmlILZflOagsXG6N2IciiyFjpBHlmmjMHZ4Za5ULoFMev9LEZXSqMsO1E8i-0VWCPDrxnMKWtsdt2KjkNXRQ4oRetTqt5uUsGtzf96zab37fAxtyTObNJpAMRJvhrgQdcI9MKUg-QQiHS-bGvtPKU0DxG2oCes-YBGGulkbG1BkQS7JG1bJbBPqEGjNYJ9lHo9eIeJtMw8SLJlfUBPRx-QGpuBkdvVUmN0XzyDv9-fEY22v1ed9S9e-gckU23opUM7Jis5e8FnCAvyPVpaQ5fDGC5Xg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2016+IEEE+Winter+Conference+on+Applications+of+Computer+Vision+%28WACV%29&rft.atitle=A+multi-modal+feature+fusion+framework+for+kinect-based+facial+expression+recognition+using+Dual+Kernel+Discriminant+Analysis+%28DKDA%29&rft.au=Aly%2C+Sherin&rft.au=Abbott%2C+A.+Lynn&rft.au=Torki%2C+Marwan&rft.date=2016-03-01&rft.pub=IEEE&rft.spage=1&rft.epage=10&rft_id=info:doi/10.1109%2FWACV.2016.7477577&rft.externalDocID=7477577