Mixed Effects Neural Networks (MeNets) With Applications to Gaze Estimation
There is much interest in computer vision to utilize commodity hardware for gaze estimation. A number of papers have shown that algorithms based on deep convolutional architectures are approaching accuracies where streaming data from mass-market devices can offer good gaze tracking performance, alth...
Saved in:
Published in | Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) pp. 7735 - 7744 |
---|---|
Main Authors | , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.06.2019
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | There is much interest in computer vision to utilize commodity hardware for gaze estimation. A number of papers have shown that algorithms based on deep convolutional architectures are approaching accuracies where streaming data from mass-market devices can offer good gaze tracking performance, although a gap still remains between what is possible and the performance users will expect in real deployments. We observe that one obvious avenue for improvement relates to a gap between some basic technical assumptions behind most existing approaches and the statistical properties of the data used for training. Specifically, most training datasets involve tens of users with a few hundreds (or more) repeated acquisitions per user. The non i.i.d. nature of this data suggests better estimation may be possible if the model explicitly made use of such "repeated measurements" from each user as is commonly done in classical statistical analysis using so-called mixed effects models. The goal of this paper is to adapt these "mixed effects" ideas from statistics within a deep neural network architecture for gaze estimation, based on eye images. Such a formulation seeks to specifically utilize information regarding the hierarchical structure of the training data - each node in the hierarchy is a user who provides tens or hundreds of repeated samples. This modification yields an architecture that offers state of the art performance on various publicly available datasets improving results by 10-20%. |
---|---|
AbstractList | There is much interest in computer vision to utilize commodity hardware for gaze estimation. A number of papers have shown that algorithms based on deep convolutional architectures are approaching accuracies where streaming data from mass-market devices can offer good gaze tracking performance, although a gap still remains between what is possible and the performance users will expect in real deployments. We observe that one obvious avenue for improvement relates to a gap between some basic technical assumptions behind most existing approaches and the statistical properties of the data used for training. Specifically, most training datasets involve tens of users with a few hundreds (or more) repeated acquisitions per user. The non i.i.d. nature of this data suggests better estimation may be possible if the model explicitly made use of such "repeated measurements" from each user as is commonly done in classical statistical analysis using so-called mixed effects models. The goal of this paper is to adapt these "mixed effects" ideas from statistics within a deep neural network architecture for gaze estimation, based on eye images. Such a formulation seeks to specifically utilize information regarding the hierarchical structure of the training data - each node in the hierarchy is a user who provides tens or hundreds of repeated samples. This modification yields an architecture that offers state of the art performance on various publicly available datasets improving results by 10-20%. |
Author | Singh, Vikas Kim, Hyunwoo J. Xiong, Yunyang |
Author_xml | – sequence: 1 givenname: Yunyang surname: Xiong fullname: Xiong, Yunyang organization: Univ. of Wisconsin-Madison – sequence: 2 givenname: Hyunwoo J. surname: Kim fullname: Kim, Hyunwoo J. organization: Korea Univ – sequence: 3 givenname: Vikas surname: Singh fullname: Singh, Vikas organization: Univ. of Wisconsin-Madison |
BookMark | eNotjztPwzAUhQ0CiVIyM7B4hCHlXjsP37Gq0oJoASEeY5XHtTCEpoqNePx6ImD6js7wHZ1DsbfpNizEMcIEEeh89nh7N1GANAHISe-IiHKDuTKoFWmzK0YImY4zQjoQkfcvAKAVYkZmJK5W7pMbWVjLdfDymt_7sh0QPrr-1cvTFQ_Zn8knF57ldLttXV0G1228DJ1clN8sCx_c2293JPZt2XqO_jkWD_PifnYRL28Wl7PpMnYKdIhzorQiZTIihDSrDBkNtqyTBmpKNFurdMlVjoptoyCtbYLKJGlTmwqw0nosTv68jpnX236Y77_WhtIkGQ7_ADFzTng |
ContentType | Conference Proceeding |
DBID | 6IE 6IH CBEJK RIE RIO |
DOI | 10.1109/CVPR.2019.00793 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Applied Sciences |
EISBN | 9781728132938 1728132932 |
EISSN | 1063-6919 |
EndPage | 7744 |
ExternalDocumentID | 8954429 |
Genre | orig-research |
GroupedDBID | 6IE 6IH 6IL 6IN AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP OCL RIE RIL RIO |
ID | FETCH-LOGICAL-i203t-7995b9286991056b89830fac4d0c943eff23aeb712efd205cf412845dc8b01b33 |
IEDL.DBID | RIE |
IngestDate | Wed Aug 27 07:44:55 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | true |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i203t-7995b9286991056b89830fac4d0c943eff23aeb712efd205cf412845dc8b01b33 |
PageCount | 10 |
ParticipantIDs | ieee_primary_8954429 |
PublicationCentury | 2000 |
PublicationDate | 2019-June |
PublicationDateYYYYMMDD | 2019-06-01 |
PublicationDate_xml | – month: 06 year: 2019 text: 2019-June |
PublicationDecade | 2010 |
PublicationTitle | Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) |
PublicationTitleAbbrev | CVPR |
PublicationYear | 2019 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0003211698 |
Score | 2.4353275 |
Snippet | There is much interest in computer vision to utilize commodity hardware for gaze estimation. A number of papers have shown that algorithms based on deep... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 7735 |
SubjectTerms | Adaptation models Analytical models and Body Pose Artificial neural networks Computational modeling Computer architecture Computer vision Data models Estimation Face Gesture Motion and Tracking Performance evaluation Training |
Title | Mixed Effects Neural Networks (MeNets) With Applications to Gaze Estimation |
URI | https://ieeexplore.ieee.org/document/8954429 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA7bTp6mbuJvcvCgYLcsSZv0KGNjKJUhTncbTfKCw7GJ7UD8603aOod48JSQHBIS8t5L3vd9QehCEGIgFCaIKIeAp8YdKaVsoIkUlgjKQHg2cnIfjSb8dhpOa-h6w4UBgAJ8Bh1fLXL5ZqXX_qmsK-OQO_tZR3V3cSu5Wpv3FOZuMlEsK_WeHom7_afxg8dueUFK4RPLW9-nFN5j2ETJ97glaOS1s85VR3_-kmT878R2UfuHp4fHGw-0h2qw3EfNKrDE1bHNWugumX-4hlKpOMNekCNduKJAgGf4MgFXz67w8zx_wTdbKW2cr7CHA-GBswQlybGNJsPBY38UVL8oBHNKWB54xTcVUxm5SNBFO0rGkhGbam6IjjkDaylLQYkeBWsoCbXl3meFRktFeoqxA9RYrpZwiDBYZx45B5MSzcH1A019fCkYN5GM-BFq-bWZvZVCGbNqWY7_bj5BO353StzVKWrk72s4cx4-V-fF1n4BeeWmJA |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT8IwGG0QD3pCBeNve_CgiYOu7dbtaAgGhRFiQLmRtf0WiQaMG4nxr7fdJhLjwdOa9rClTb_3rd97rwhdCEI0eEI7PuXg8FibLSVl4igSiIQIykBYNXI08Ltjfj_xJhV0vdLCAEBOPoOmbea1fL1QS3tU1gpCj5v4uYE2De57bqHWWp2oMPMv44dB6d_jkrDVfhw-WPaWtaQUtrS8doFKjh-3NRR9v7mgjbw0l5lsqs9fpoz__bQd1PhR6uHhCoN2UQXme6hWppa43LhpHfWi2YfpKLyKU2wtOeJX88g54Cm-jMC00yv8NMue8c1aURtnC2wJQbhjYkEhc2yg8W1n1O465T0KzowSljnW802GNPBNLmjyHRmEASNJrLgmKuQMkoSyGKRwKSSaEk8l3KKWp1UgiSsZ20fV-WIOBwhDYgIk56BjojiYcaCxzTAF49oPfH6I6nZupm-FVca0nJajv7vP0VZ3FPWn_btB7xht25UqWFgnqJq9L-HU4H0mz_Jl_gJV_alt |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+%28IEEE+Computer+Society+Conference+on+Computer+Vision+and+Pattern+Recognition.+Online%29&rft.atitle=Mixed+Effects+Neural+Networks+%28MeNets%29+With+Applications+to+Gaze+Estimation&rft.au=Xiong%2C+Yunyang&rft.au=Kim%2C+Hyunwoo+J.&rft.au=Singh%2C+Vikas&rft.date=2019-06-01&rft.pub=IEEE&rft.eissn=1063-6919&rft.spage=7735&rft.epage=7744&rft_id=info:doi/10.1109%2FCVPR.2019.00793&rft.externalDocID=8954429 |