Mixed Effects Neural Networks (MeNets) With Applications to Gaze Estimation

There is much interest in computer vision to utilize commodity hardware for gaze estimation. A number of papers have shown that algorithms based on deep convolutional architectures are approaching accuracies where streaming data from mass-market devices can offer good gaze tracking performance, alth...

Full description

Saved in:

Bibliographic Details
Published in	Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) pp. 7735 - 7744
Main Authors	Xiong, Yunyang, Kim, Hyunwoo J., Singh, Vikas
Format	Conference Proceeding
Language	English
Published	IEEE 01.06.2019
Subjects	Adaptation models Analytical models and Body Pose Artificial neural networks Computational modeling Computer architecture Computer vision Data models Estimation Face Gesture Motion and Tracking Performance evaluation Training
Online Access	Get full text

Cover

Loading…

Abstract	There is much interest in computer vision to utilize commodity hardware for gaze estimation. A number of papers have shown that algorithms based on deep convolutional architectures are approaching accuracies where streaming data from mass-market devices can offer good gaze tracking performance, although a gap still remains between what is possible and the performance users will expect in real deployments. We observe that one obvious avenue for improvement relates to a gap between some basic technical assumptions behind most existing approaches and the statistical properties of the data used for training. Specifically, most training datasets involve tens of users with a few hundreds (or more) repeated acquisitions per user. The non i.i.d. nature of this data suggests better estimation may be possible if the model explicitly made use of such "repeated measurements" from each user as is commonly done in classical statistical analysis using so-called mixed effects models. The goal of this paper is to adapt these "mixed effects" ideas from statistics within a deep neural network architecture for gaze estimation, based on eye images. Such a formulation seeks to specifically utilize information regarding the hierarchical structure of the training data - each node in the hierarchy is a user who provides tens or hundreds of repeated samples. This modification yields an architecture that offers state of the art performance on various publicly available datasets improving results by 10-20%.
AbstractList	There is much interest in computer vision to utilize commodity hardware for gaze estimation. A number of papers have shown that algorithms based on deep convolutional architectures are approaching accuracies where streaming data from mass-market devices can offer good gaze tracking performance, although a gap still remains between what is possible and the performance users will expect in real deployments. We observe that one obvious avenue for improvement relates to a gap between some basic technical assumptions behind most existing approaches and the statistical properties of the data used for training. Specifically, most training datasets involve tens of users with a few hundreds (or more) repeated acquisitions per user. The non i.i.d. nature of this data suggests better estimation may be possible if the model explicitly made use of such "repeated measurements" from each user as is commonly done in classical statistical analysis using so-called mixed effects models. The goal of this paper is to adapt these "mixed effects" ideas from statistics within a deep neural network architecture for gaze estimation, based on eye images. Such a formulation seeks to specifically utilize information regarding the hierarchical structure of the training data - each node in the hierarchy is a user who provides tens or hundreds of repeated samples. This modification yields an architecture that offers state of the art performance on various publicly available datasets improving results by 10-20%.
Author	Singh, Vikas Kim, Hyunwoo J. Xiong, Yunyang
Author_xml	– sequence: 1 givenname: Yunyang surname: Xiong fullname: Xiong, Yunyang organization: Univ. of Wisconsin-Madison – sequence: 2 givenname: Hyunwoo J. surname: Kim fullname: Kim, Hyunwoo J. organization: Korea Univ – sequence: 3 givenname: Vikas surname: Singh fullname: Singh, Vikas organization: Univ. of Wisconsin-Madison
BookMark	eNotjztPwzAUhQ0CiVIyM7B4hCHlXjsP37Gq0oJoASEeY5XHtTCEpoqNePx6ImD6js7wHZ1DsbfpNizEMcIEEeh89nh7N1GANAHISe-IiHKDuTKoFWmzK0YImY4zQjoQkfcvAKAVYkZmJK5W7pMbWVjLdfDymt_7sh0QPrr-1cvTFQ_Zn8knF57ldLttXV0G1228DJ1clN8sCx_c2293JPZt2XqO_jkWD_PifnYRL28Wl7PpMnYKdIhzorQiZTIihDSrDBkNtqyTBmpKNFurdMlVjoptoyCtbYLKJGlTmwqw0nosTv68jpnX236Y77_WhtIkGQ7_ADFzTng
ContentType	Conference Proceeding
DBID	6IE 6IH CBEJK RIE RIO
DOI	10.1109/CVPR.2019.00793
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Applied Sciences
EISBN	9781728132938 1728132932
EISSN	1063-6919
EndPage	7744
ExternalDocumentID	8954429
Genre	orig-research
GroupedDBID	6IE 6IH 6IL 6IN AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP OCL RIE RIL RIO
ID	FETCH-LOGICAL-i203t-7995b9286991056b89830fac4d0c943eff23aeb712efd205cf412845dc8b01b33
IEDL.DBID	RIE
IngestDate	Wed Aug 27 07:44:55 EDT 2025
IsPeerReviewed	false
IsScholarly	true
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i203t-7995b9286991056b89830fac4d0c943eff23aeb712efd205cf412845dc8b01b33
PageCount	10
ParticipantIDs	ieee_primary_8954429
PublicationCentury	2000
PublicationDate	2019-June
PublicationDateYYYYMMDD	2019-06-01
PublicationDate_xml	– month: 06 year: 2019 text: 2019-June
PublicationDecade	2010
PublicationTitle	Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online)
PublicationTitleAbbrev	CVPR
PublicationYear	2019
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0003211698
Score	2.4353275
Snippet	There is much interest in computer vision to utilize commodity hardware for gaze estimation. A number of papers have shown that algorithms based on deep...
SourceID	ieee
SourceType	Publisher
StartPage	7735
SubjectTerms	Adaptation models Analytical models and Body Pose Artificial neural networks Computational modeling Computer architecture Computer vision Data models Estimation Face Gesture Motion and Tracking Performance evaluation Training
Title	Mixed Effects Neural Networks (MeNets) With Applications to Gaze Estimation
URI	https://ieeexplore.ieee.org/document/8954429
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA7bTp6mbuJvcvCgYLcsSZv0KGNjKJUhTncbTfKCw7GJ7UD8603aOod48JSQHBIS8t5L3vd9QehCEGIgFCaIKIeAp8YdKaVsoIkUlgjKQHg2cnIfjSb8dhpOa-h6w4UBgAJ8Bh1fLXL5ZqXX_qmsK-OQO_tZR3V3cSu5Wpv3FOZuMlEsK_WeHom7_afxg8dueUFK4RPLW9-nFN5j2ETJ97glaOS1s85VR3_-kmT878R2UfuHp4fHGw-0h2qw3EfNKrDE1bHNWugumX-4hlKpOMNekCNduKJAgGf4MgFXz67w8zx_wTdbKW2cr7CHA-GBswQlybGNJsPBY38UVL8oBHNKWB54xTcVUxm5SNBFO0rGkhGbam6IjjkDaylLQYkeBWsoCbXl3meFRktFeoqxA9RYrpZwiDBYZx45B5MSzcH1A019fCkYN5GM-BFq-bWZvZVCGbNqWY7_bj5BO353StzVKWrk72s4cx4-V-fF1n4BeeWmJA
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT8IwGG0QD3pCBeNve_CgiYOu7dbtaAgGhRFiQLmRtf0WiQaMG4nxr7fdJhLjwdOa9rClTb_3rd97rwhdCEI0eEI7PuXg8FibLSVl4igSiIQIykBYNXI08Ltjfj_xJhV0vdLCAEBOPoOmbea1fL1QS3tU1gpCj5v4uYE2De57bqHWWp2oMPMv44dB6d_jkrDVfhw-WPaWtaQUtrS8doFKjh-3NRR9v7mgjbw0l5lsqs9fpoz__bQd1PhR6uHhCoN2UQXme6hWppa43LhpHfWi2YfpKLyKU2wtOeJX88g54Cm-jMC00yv8NMue8c1aURtnC2wJQbhjYkEhc2yg8W1n1O465T0KzowSljnW802GNPBNLmjyHRmEASNJrLgmKuQMkoSyGKRwKSSaEk8l3KKWp1UgiSsZ20fV-WIOBwhDYgIk56BjojiYcaCxzTAF49oPfH6I6nZupm-FVca0nJajv7vP0VZ3FPWn_btB7xht25UqWFgnqJq9L-HU4H0mz_Jl_gJV_alt
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+%28IEEE+Computer+Society+Conference+on+Computer+Vision+and+Pattern+Recognition.+Online%29&rft.atitle=Mixed+Effects+Neural+Networks+%28MeNets%29+With+Applications+to+Gaze+Estimation&rft.au=Xiong%2C+Yunyang&rft.au=Kim%2C+Hyunwoo+J.&rft.au=Singh%2C+Vikas&rft.date=2019-06-01&rft.pub=IEEE&rft.eissn=1063-6919&rft.spage=7735&rft.epage=7744&rft_id=info:doi/10.1109%2FCVPR.2019.00793&rft.externalDocID=8954429