Mixed Effects Neural Networks (MeNets) With Applications to Gaze Estimation

There is much interest in computer vision to utilize commodity hardware for gaze estimation. A number of papers have shown that algorithms based on deep convolutional architectures are approaching accuracies where streaming data from mass-market devices can offer good gaze tracking performance, alth...

Full description

Saved in:
Bibliographic Details
Published inProceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) pp. 7735 - 7744
Main Authors Xiong, Yunyang, Kim, Hyunwoo J., Singh, Vikas
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.06.2019
Subjects
Online AccessGet full text

Cover

Loading…
Abstract There is much interest in computer vision to utilize commodity hardware for gaze estimation. A number of papers have shown that algorithms based on deep convolutional architectures are approaching accuracies where streaming data from mass-market devices can offer good gaze tracking performance, although a gap still remains between what is possible and the performance users will expect in real deployments. We observe that one obvious avenue for improvement relates to a gap between some basic technical assumptions behind most existing approaches and the statistical properties of the data used for training. Specifically, most training datasets involve tens of users with a few hundreds (or more) repeated acquisitions per user. The non i.i.d. nature of this data suggests better estimation may be possible if the model explicitly made use of such "repeated measurements" from each user as is commonly done in classical statistical analysis using so-called mixed effects models. The goal of this paper is to adapt these "mixed effects" ideas from statistics within a deep neural network architecture for gaze estimation, based on eye images. Such a formulation seeks to specifically utilize information regarding the hierarchical structure of the training data - each node in the hierarchy is a user who provides tens or hundreds of repeated samples. This modification yields an architecture that offers state of the art performance on various publicly available datasets improving results by 10-20%.
AbstractList There is much interest in computer vision to utilize commodity hardware for gaze estimation. A number of papers have shown that algorithms based on deep convolutional architectures are approaching accuracies where streaming data from mass-market devices can offer good gaze tracking performance, although a gap still remains between what is possible and the performance users will expect in real deployments. We observe that one obvious avenue for improvement relates to a gap between some basic technical assumptions behind most existing approaches and the statistical properties of the data used for training. Specifically, most training datasets involve tens of users with a few hundreds (or more) repeated acquisitions per user. The non i.i.d. nature of this data suggests better estimation may be possible if the model explicitly made use of such "repeated measurements" from each user as is commonly done in classical statistical analysis using so-called mixed effects models. The goal of this paper is to adapt these "mixed effects" ideas from statistics within a deep neural network architecture for gaze estimation, based on eye images. Such a formulation seeks to specifically utilize information regarding the hierarchical structure of the training data - each node in the hierarchy is a user who provides tens or hundreds of repeated samples. This modification yields an architecture that offers state of the art performance on various publicly available datasets improving results by 10-20%.
Author Singh, Vikas
Kim, Hyunwoo J.
Xiong, Yunyang
Author_xml – sequence: 1
  givenname: Yunyang
  surname: Xiong
  fullname: Xiong, Yunyang
  organization: Univ. of Wisconsin-Madison
– sequence: 2
  givenname: Hyunwoo J.
  surname: Kim
  fullname: Kim, Hyunwoo J.
  organization: Korea Univ
– sequence: 3
  givenname: Vikas
  surname: Singh
  fullname: Singh, Vikas
  organization: Univ. of Wisconsin-Madison
BookMark eNotjztPwzAUhQ0CiVIyM7B4hCHlXjsP37Gq0oJoASEeY5XHtTCEpoqNePx6ImD6js7wHZ1DsbfpNizEMcIEEeh89nh7N1GANAHISe-IiHKDuTKoFWmzK0YImY4zQjoQkfcvAKAVYkZmJK5W7pMbWVjLdfDymt_7sh0QPrr-1cvTFQ_Zn8knF57ldLttXV0G1228DJ1clN8sCx_c2293JPZt2XqO_jkWD_PifnYRL28Wl7PpMnYKdIhzorQiZTIihDSrDBkNtqyTBmpKNFurdMlVjoptoyCtbYLKJGlTmwqw0nosTv68jpnX236Y77_WhtIkGQ7_ADFzTng
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/CVPR.2019.00793
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Applied Sciences
EISBN 9781728132938
1728132932
EISSN 1063-6919
EndPage 7744
ExternalDocumentID 8954429
Genre orig-research
GroupedDBID 6IE
6IH
6IL
6IN
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
OCL
RIE
RIL
RIO
ID FETCH-LOGICAL-i203t-7995b9286991056b89830fac4d0c943eff23aeb712efd205cf412845dc8b01b33
IEDL.DBID RIE
IngestDate Wed Aug 27 07:44:55 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i203t-7995b9286991056b89830fac4d0c943eff23aeb712efd205cf412845dc8b01b33
PageCount 10
ParticipantIDs ieee_primary_8954429
PublicationCentury 2000
PublicationDate 2019-June
PublicationDateYYYYMMDD 2019-06-01
PublicationDate_xml – month: 06
  year: 2019
  text: 2019-June
PublicationDecade 2010
PublicationTitle Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online)
PublicationTitleAbbrev CVPR
PublicationYear 2019
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0003211698
Score 2.4353275
Snippet There is much interest in computer vision to utilize commodity hardware for gaze estimation. A number of papers have shown that algorithms based on deep...
SourceID ieee
SourceType Publisher
StartPage 7735
SubjectTerms Adaptation models
Analytical models
and Body Pose
Artificial neural networks
Computational modeling
Computer architecture
Computer vision
Data models
Estimation
Face
Gesture
Motion and Tracking
Performance evaluation
Training
Title Mixed Effects Neural Networks (MeNets) With Applications to Gaze Estimation
URI https://ieeexplore.ieee.org/document/8954429
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA7bTp6mbuJvcvCgYLcsSZv0KGNjKJUhTncbTfKCw7GJ7UD8603aOod48JSQHBIS8t5L3vd9QehCEGIgFCaIKIeAp8YdKaVsoIkUlgjKQHg2cnIfjSb8dhpOa-h6w4UBgAJ8Bh1fLXL5ZqXX_qmsK-OQO_tZR3V3cSu5Wpv3FOZuMlEsK_WeHom7_afxg8dueUFK4RPLW9-nFN5j2ETJ97glaOS1s85VR3_-kmT878R2UfuHp4fHGw-0h2qw3EfNKrDE1bHNWugumX-4hlKpOMNekCNduKJAgGf4MgFXz67w8zx_wTdbKW2cr7CHA-GBswQlybGNJsPBY38UVL8oBHNKWB54xTcVUxm5SNBFO0rGkhGbam6IjjkDaylLQYkeBWsoCbXl3meFRktFeoqxA9RYrpZwiDBYZx45B5MSzcH1A019fCkYN5GM-BFq-bWZvZVCGbNqWY7_bj5BO353StzVKWrk72s4cx4-V-fF1n4BeeWmJA
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT8IwGG0QD3pCBeNve_CgiYOu7dbtaAgGhRFiQLmRtf0WiQaMG4nxr7fdJhLjwdOa9rClTb_3rd97rwhdCEI0eEI7PuXg8FibLSVl4igSiIQIykBYNXI08Ltjfj_xJhV0vdLCAEBOPoOmbea1fL1QS3tU1gpCj5v4uYE2De57bqHWWp2oMPMv44dB6d_jkrDVfhw-WPaWtaQUtrS8doFKjh-3NRR9v7mgjbw0l5lsqs9fpoz__bQd1PhR6uHhCoN2UQXme6hWppa43LhpHfWi2YfpKLyKU2wtOeJX88g54Cm-jMC00yv8NMue8c1aURtnC2wJQbhjYkEhc2yg8W1n1O465T0KzowSljnW802GNPBNLmjyHRmEASNJrLgmKuQMkoSyGKRwKSSaEk8l3KKWp1UgiSsZ20fV-WIOBwhDYgIk56BjojiYcaCxzTAF49oPfH6I6nZupm-FVca0nJajv7vP0VZ3FPWn_btB7xht25UqWFgnqJq9L-HU4H0mz_Jl_gJV_alt
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+%28IEEE+Computer+Society+Conference+on+Computer+Vision+and+Pattern+Recognition.+Online%29&rft.atitle=Mixed+Effects+Neural+Networks+%28MeNets%29+With+Applications+to+Gaze+Estimation&rft.au=Xiong%2C+Yunyang&rft.au=Kim%2C+Hyunwoo+J.&rft.au=Singh%2C+Vikas&rft.date=2019-06-01&rft.pub=IEEE&rft.eissn=1063-6919&rft.spage=7735&rft.epage=7744&rft_id=info:doi/10.1109%2FCVPR.2019.00793&rft.externalDocID=8954429