A multi-modal feature fusion framework for kinect-based facial expression recognition using Dual Kernel Discriminant Analysis (DKDA)
We present a multi-modal feature fusion framework for Kinect-based Facial Expression Recognition (FER). The framework extracts and pre-processes 2D and 3D features separately. The types of 2D and 3D features are selected to maximize the accuracy of the system, with the Histogram of Oriented Gradient...
Saved in:
Published in | 2016 IEEE Winter Conference on Applications of Computer Vision (WACV) pp. 1 - 10 |
---|---|
Main Authors | , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.03.2016
|
Subjects | |
Online Access | Get full text |
DOI | 10.1109/WACV.2016.7477577 |
Cover
Abstract | We present a multi-modal feature fusion framework for Kinect-based Facial Expression Recognition (FER). The framework extracts and pre-processes 2D and 3D features separately. The types of 2D and 3D features are selected to maximize the accuracy of the system, with the Histogram of Oriented Gradient (HOG) features for 2D data and statistically selected angles for 3D data giving the best performance. The sets of 2D features and 3D features are reduced and later combined using a novel Dual Kernel Discriminant Analysis (DKDA) approach. Final classification is done using SVMs. The framework is benchmarked on a public Kinect-based FER dataset which includes data for 32 subjects (in both frontal and non-frontal poses and two expression intensities) and 6 basic expressions (plus neutral), namely: happiness, sadness, anger, disgust, fear, and surprise. The framework shows that the proposed combination of 2D and 3D features outperforms simpler existing combinations of 2D and 3D features, as well as systems that use either 2D or 3D features only. The proposed system also outperforms Linear Discriminant Analysis (LDA)-transformed and traditional Kernel Discriminant Analysis (KDA)-transformed systems, with an average accuracy improving of 10%. It also outperforms the state of the art by more than 13% in frontal poses. |
---|---|
AbstractList | We present a multi-modal feature fusion framework for Kinect-based Facial Expression Recognition (FER). The framework extracts and pre-processes 2D and 3D features separately. The types of 2D and 3D features are selected to maximize the accuracy of the system, with the Histogram of Oriented Gradient (HOG) features for 2D data and statistically selected angles for 3D data giving the best performance. The sets of 2D features and 3D features are reduced and later combined using a novel Dual Kernel Discriminant Analysis (DKDA) approach. Final classification is done using SVMs. The framework is benchmarked on a public Kinect-based FER dataset which includes data for 32 subjects (in both frontal and non-frontal poses and two expression intensities) and 6 basic expressions (plus neutral), namely: happiness, sadness, anger, disgust, fear, and surprise. The framework shows that the proposed combination of 2D and 3D features outperforms simpler existing combinations of 2D and 3D features, as well as systems that use either 2D or 3D features only. The proposed system also outperforms Linear Discriminant Analysis (LDA)-transformed and traditional Kernel Discriminant Analysis (KDA)-transformed systems, with an average accuracy improving of 10%. It also outperforms the state of the art by more than 13% in frontal poses. |
Author | Aly, Sherin Torki, Marwan Abbott, A. Lynn |
Author_xml | – sequence: 1 givenname: Sherin surname: Aly fullname: Aly, Sherin email: sherin@vt.edu organization: Bradley Dept. of Electr. & Comput. Eng, Virginia Tech, Blacksburg, VA, USA – sequence: 2 givenname: A. Lynn surname: Abbott fullname: Abbott, A. Lynn email: abbott@vt.edu organization: Bradley Dept. of Electr. & Comput. Eng, Virginia Tech, Blacksburg, VA, USA – sequence: 3 givenname: Marwan surname: Torki fullname: Torki, Marwan email: mtorki@alexu.edu.eg organization: Dept. of Comput. Eng., Alexandria Univ., Alexandria, Egypt |
BookMark | eNotkD1PwzAYhI0EErT0ByAWjzCkvI7d2B6jhi-1EgsfY-UkryuriVPZiaA7P5wAne6Ge06nm5BT33kk5IrBnDHQdx_58n2eAsvmUki5kPKETNgCNEAmmD4nsxhdCRwYgMr0BfnOaTs0vUvarjYNtWj6ISC1Q3SdpzaYFj-7sKO2C3TnPFZ9UpqINbWmciOAX_uA8S8csOq23vW_fsT9lhbDmFhh8NjQwsUquNZ543uae9Mcoov0plgV-e0lObOmiTg76pS8Pdy_Lp-S9cvj8zJfJy4F1SdWZyUXC42IHDAtRcp5Bai4FowzyUxpwIgylaBSqVSZcqytVbWS1tYoNJ-S6_9eN1Zs9uMcEw6b41H8B0avYpM |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/WACV.2016.7477577 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 1509006419 9781509006410 |
EndPage | 10 |
ExternalDocumentID | 7477577 |
Genre | orig-research |
GroupedDBID | 6IE 6IL ALMA_UNASSIGNED_HOLDINGS CBEJK RIB RIC RIE RIL |
ID | FETCH-LOGICAL-i208t-f96b3459eee30e2b4233c0e839413171aba0a4b27082788b23edff8d87ffde493 |
IEDL.DBID | RIE |
IngestDate | Wed Dec 20 05:18:49 EST 2023 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i208t-f96b3459eee30e2b4233c0e839413171aba0a4b27082788b23edff8d87ffde493 |
PageCount | 10 |
ParticipantIDs | ieee_primary_7477577 |
PublicationCentury | 2000 |
PublicationDate | 20160301 |
PublicationDateYYYYMMDD | 2016-03-01 |
PublicationDate_xml | – month: 03 year: 2016 text: 20160301 day: 01 |
PublicationDecade | 2010 |
PublicationTitle | 2016 IEEE Winter Conference on Applications of Computer Vision (WACV) |
PublicationTitleAbbrev | WACV |
PublicationYear | 2016 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssib030100869 |
Score | 1.6799953 |
Snippet | We present a multi-modal feature fusion framework for Kinect-based Facial Expression Recognition (FER). The framework extracts and pre-processes 2D and 3D... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 1 |
SubjectTerms | Face Face recognition Feature extraction Histograms Kernel Sensors Three-dimensional displays |
Title | A multi-modal feature fusion framework for kinect-based facial expression recognition using Dual Kernel Discriminant Analysis (DKDA) |
URI | https://ieeexplore.ieee.org/document/7477577 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LTwIxEG6Akyc1YHynBw-aWNjtdne7RwISIsF4EOVG2u3UEHAxZPfi2R_udBcwGg_emqZNX5PON9NvpoRcac59FZmUgVSciUgKpqUQDI0NK2So4iR10cjjh2g4EffTcFojt7tYGAAoyWfQdsXyLd-s0sK5yjoIfeMwjuukjmJWxWptZQfl1KHzZPNw6XtJ56Xbe3bcrai96ffjA5VSfwz2yXg7ckUbWbSLXLfTj19JGf87tQPS-o7Uo487HXRIapA1yWeXljxB9rYyakktlMk7qS2cZ4zaLR2LIl6lC0SZac6cMjPUKudApzh-xY7N6I5fhGVHkX-l_QJbjGCdwZL25-7Sqcg0dJvehF73R_3uTYtMBndPvSHb_LbA5tyTObNJpAMRJri0wAOuEWcFqQcIoFDP-bGvtPKU0DxG0IB2s-YBGGulkbG1BkQSHJFGtsrgmFADRusE2yi0efEGk2mYeJHkyvqA9g0_IU23g7P3KqHGbLN5p39Xn5E9d4oV8eucNPJ1AReIBHJ9WYrAF1pmthE |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwELX4GGACVBDfeGAACbeJ4yTOWFFQoS1iaKFbZcdnVLWkCCULMz-cc9IWgRjYrMiRE_vk9-787kzIuebcV5FJGUjFmYikYFoKwdDZsEKGKk5Sl43ce4jaA3E_DIcr5GqZCwMApfgM6q5ZnuWbWVq4UFkDqW8cxvEqWUfcF2GVrbWwHrRUx8-T-dGl7yWN5-b1k1NvRfX5mz-uUCkR5HaL9BZjV8KRSb3IdT39-FWW8b8ft012v3P16OMShXbICmQ18tmkpVKQvc6MmlILZflOagsXG6N2IciiyFjpBHlmmjMHZ4Za5ULoFMev9LEZXSqMsO1E8i-0VWCPDrxnMKWtsdt2KjkNXRQ4oRetTqt5uUsGtzf96zab37fAxtyTObNJpAMRJvhrgQdcI9MKUg-QQiHS-bGvtPKU0DxG2oCes-YBGGulkbG1BkQS7JG1bJbBPqEGjNYJ9lHo9eIeJtMw8SLJlfUBPRx-QGpuBkdvVUmN0XzyDv9-fEY22v1ed9S9e-gckU23opUM7Jis5e8FnCAvyPVpaQ5fDGC5Xg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2016+IEEE+Winter+Conference+on+Applications+of+Computer+Vision+%28WACV%29&rft.atitle=A+multi-modal+feature+fusion+framework+for+kinect-based+facial+expression+recognition+using+Dual+Kernel+Discriminant+Analysis+%28DKDA%29&rft.au=Aly%2C+Sherin&rft.au=Abbott%2C+A.+Lynn&rft.au=Torki%2C+Marwan&rft.date=2016-03-01&rft.pub=IEEE&rft.spage=1&rft.epage=10&rft_id=info:doi/10.1109%2FWACV.2016.7477577&rft.externalDocID=7477577 |