DeepFace: Closing the Gap to Human-Level Performance in Face Verification

In modern face recognition, the conventional pipeline consists of four stages: detect => align => represent => classify. We revisit both the alignment step and the representation step by employing explicit 3D face modeling in order to apply a piecewise affine transformation, and derive a fa...

Full description

Saved in:
Bibliographic Details
Published in2014 IEEE Conference on Computer Vision and Pattern Recognition pp. 1701 - 1708
Main Authors Taigman, Yaniv, Ming Yang, Ranzato, Marc'Aurelio, Wolf, Lior
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.06.2014
Subjects
Online AccessGet full text

Cover

Loading…
Abstract In modern face recognition, the conventional pipeline consists of four stages: detect => align => represent => classify. We revisit both the alignment step and the representation step by employing explicit 3D face modeling in order to apply a piecewise affine transformation, and derive a face representation from a nine-layer deep neural network. This deep network involves more than 120 million parameters using several locally connected layers without weight sharing, rather than the standard convolutional layers. Thus we trained it on the largest facial dataset to-date, an identity labeled dataset of four million facial images belonging to more than 4, 000 identities. The learned representations coupling the accurate model-based alignment with the large facial database generalize remarkably well to faces in unconstrained environments, even with a simple classifier. Our method reaches an accuracy of 97.35% on the Labeled Faces in the Wild (LFW) dataset, reducing the error of the current state of the art by more than 27%, closely approaching human-level performance.
AbstractList In modern face recognition, the conventional pipeline consists of four stages: detect => align => represent => classify. We revisit both the alignment step and the representation step by employing explicit 3D face modeling in order to apply a piecewise affine transformation, and derive a face representation from a nine-layer deep neural network. This deep network involves more than 120 million parameters using several locally connected layers without weight sharing, rather than the standard convolutional layers. Thus we trained it on the largest facial dataset to-date, an identity labeled dataset of four million facial images belonging to more than 4, 000 identities. The learned representations coupling the accurate model-based alignment with the large facial database generalize remarkably well to faces in unconstrained environments, even with a simple classifier. Our method reaches an accuracy of 97.35% on the Labeled Faces in the Wild (LFW) dataset, reducing the error of the current state of the art by more than 27%, closely approaching human-level performance.
Author Ranzato, Marc'Aurelio
Taigman, Yaniv
Ming Yang
Wolf, Lior
Author_xml – sequence: 1
  givenname: Yaniv
  surname: Taigman
  fullname: Taigman, Yaniv
  organization: Facebook AI Res., Menlo Park, CA, USA
– sequence: 2
  surname: Ming Yang
  fullname: Ming Yang
  email: mingyang@fb.com
  organization: Facebook AI Res., Menlo Park, CA, USA
– sequence: 3
  givenname: Marc'Aurelio
  surname: Ranzato
  fullname: Ranzato, Marc'Aurelio
  email: ranzato@fb.com
  organization: Facebook AI Res., Menlo Park, CA, USA
– sequence: 4
  givenname: Lior
  surname: Wolf
  fullname: Wolf, Lior
  email: wolf@cs.tau.ac.il
  organization: Tel Aviv Univ., Tel Aviv, Israel
BookMark eNotj81KxDAYRSOM4Dh26cpNXqA1X34bd1KdHyg4iM52SNMvGui0pa2Cbz8VXV3u4lzOvSaLtmuRkFtgGQCz98Vh_5pxBjLjnF2QxJocpLFWAeRqQZbAtEi1BXtFknGMFePaaKmEXpLdE2K_dh4faNF0Y2w_6PSJdON6OnV0-3VybVriNzZ0j0Pohrl7pLGlvww94BBD9G6KXXtDLoNrRkz-c0Xe189vxTYtXza74rFMIzdySuscZzevpDRMOc2D4yoPVmKFCo0XITgjAbXAaj4hbM1rzPMqMABEz4NYkbu_3YiIx36IJzf8HLVlVoMWZxduTgI
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/CVPR.2014.220
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library Online
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library Online
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Applied Sciences
Agriculture
Computer Science
EISBN 9781479951185
1479951188
EndPage 1708
ExternalDocumentID 6909616
Genre orig-research
GroupedDBID 23M
29F
29O
6IE
6IH
6IK
ACGFS
ALMA_UNASSIGNED_HOLDINGS
CBEJK
G8K
IPLJI
JC5
M43
RIE
RIG
RIO
RNS
ID FETCH-LOGICAL-i274t-d8e147c544705a62fa258f94ebe5e7c3ffa741e63eb81439d2de88bf011eec2f3
IEDL.DBID RIE
ISSN 1063-6919
IngestDate Wed Jun 26 19:23:54 EDT 2024
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i274t-d8e147c544705a62fa258f94ebe5e7c3ffa741e63eb81439d2de88bf011eec2f3
PageCount 8
ParticipantIDs ieee_primary_6909616
PublicationCentury 2000
PublicationDate 20140601
PublicationDateYYYYMMDD 2014-06-01
PublicationDate_xml – month: 06
  year: 2014
  text: 20140601
  day: 01
PublicationDecade 2010
PublicationTitle 2014 IEEE Conference on Computer Vision and Pattern Recognition
PublicationTitleAbbrev CVPR
PublicationYear 2014
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssib026764536
ssj0023720
Score 2.576663
Snippet In modern face recognition, the conventional pipeline consists of four stages: detect => align => represent => classify. We revisit both the alignment step and...
SourceID ieee
SourceType Publisher
StartPage 1701
SubjectTerms Agriculture
Face
Face recognition
Shape
Solid modeling
Three-dimensional displays
Training
Title DeepFace: Closing the Gap to Human-Level Performance in Face Verification
URI https://ieeexplore.ieee.org/document/6909616
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELZKJ1gKbRFveWDEaeLYTsyGCqUgiipEq26Vk5xRBUqqki78euw8WoQY2KyTLFnns313vu8-hC4T4VMOPLCdjyPCIuaSyANJBFWeq0LGeWDByaNnMZywxxmfNdDVBgsDAEXxGTh2WPzlJ1m8tqmynonkpPDEDtoJXVpitWrboSIQjFvu7irYsuwrxU-n8ImQntz21-z1p-MXW9TFHGppvn-wqhSPyqCFRvVyylqSd2edR0789atT43_Xu4-6W_geHm8epgPUgLSN9m7eVlWfDWijVuV94upsfxpRTfBQyzro4RZgOVAxXOP-R2azCtj4i_heLXGe4SL_T55s1REeb_EHeJFiOwdPjXHrKiXYRZPB3Wt_SCruBbIwcWpOkhA8FsScscDlSlCtKA-1ZGbPOQSxr7UyvggIH6LQuFwyoQmEYaTNdQEQU-0fomaapXCEsLl0E534RhFMsph5ypx5QSPtJ6Ak6PgYdazq5suyvca80trJ3-JTtGu3rqzWOkPNfLWGc-MX5NFFYRDfOPi0Yw
link.rule.ids 310,311,786,790,795,796,802,27958,55109
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LTwIxEJ4gHtQLChjf9uDRBbbbdrfeDIqgQIgBwo10d6eGaIDgcvHX2y4LGOPBWzNJk2Y67bznA7iJhUc5ct9OPg4dFrKaE7ooHUGVW1MB49y3zcmdrmgO2POIj3Jwu-mFQcS0-Awrdpnm8uNZtLShsqrx5KRwxQ7sGj1fk6turbX0UOELxi16d-ZuWfyVNNcpPEdIV24nbFbrw96rLetiFWqBvn_gqqRqpVGAzvpAq2qS98oyCSvR169Zjf898SGUtw18pLdRTUeQw2kRDu7fFtmkDSxCIbM_Sfa6Pw1pDfGwppWg9YA4b6gI70j9Y2bjCsRYjORJzUkyI2kGwGnbuiPS23YgkMmU2D1kaMRbZ0HBMgwaj_1608nQF5yJ8VQTJw7QZX7EGfNrXAmqFeWBlszcOkc_8rRWxhpB4WEYGKNLxjTGIAi1-TAQI6q9Y8hPZ1M8AWK-3VjHnmEEkyxirjKvXtBQezEqiTo6hZJl3Xi-GrAxzrh29jf5Gvaa_U573G51X85h317jqnbrAvLJYomXxkpIwqtUOL4B7nS3uQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2014+IEEE+Conference+on+Computer+Vision+and+Pattern+Recognition&rft.atitle=DeepFace%3A+Closing+the+Gap+to+Human-Level+Performance+in+Face+Verification&rft.au=Taigman%2C+Yaniv&rft.au=Ming+Yang&rft.au=Ranzato%2C+Marc%27Aurelio&rft.au=Wolf%2C+Lior&rft.date=2014-06-01&rft.pub=IEEE&rft.issn=1063-6919&rft.spage=1701&rft.epage=1708&rft_id=info:doi/10.1109%2FCVPR.2014.220&rft.externalDocID=6909616
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1063-6919&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1063-6919&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1063-6919&client=summon