FSKT‐GE: Feature maps similarity knowledge transfer for low‐resolution gaze estimation

The limited of texture details information in low‐resolution facial or eye images presents a challenge for gaze estimation. To address this, FSKT‐GE (feature maps similarity knowledge transfer for low‐resolution gaze estimation) is proposed, a gaze estimation framework consisting of both a high reso...

Full description

Saved in:
Bibliographic Details
Published inIET image processing Vol. 18; no. 6; pp. 1642 - 1654
Main Authors Yan, Chao, Pan, Weiguo, Dai, Songyin, Xu, Bingxin, Xu, Cheng, Liu, Hongzhe, Li, Xuewei
Format Journal Article
LanguageEnglish
Published Wiley 01.05.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The limited of texture details information in low‐resolution facial or eye images presents a challenge for gaze estimation. To address this, FSKT‐GE (feature maps similarity knowledge transfer for low‐resolution gaze estimation) is proposed, a gaze estimation framework consisting of both a high resolution (HR) network and low resolution (LR) network with the identical structure. Rather than mere feature imitation, this issue is addressed by assessing the cosine similarity of feature layers, emphasizing the distribution similarity between the HR and LR networks. This enables the LR network to acquire richer knowledge. This framework utilizes a combination loss function, incorporating cosine similarity measurement, soft loss based on probability distribution difference and gaze direction output, along with a hard loss from the LR network output layer. This approach on low‐resolution datasets derived from Gaze360 and RT‐Gene datasets is validated, demonstrating excellent performance in low‐resolution gaze estimation. Evaluations on low‐resolution images obtained through 2×, 4×, and 8× down‐sampling are conducted on two datasets. On the Gaze360 dataset, the lowest mean angular errors of 10.97°, 11.22°, and 13.61° were achieved, while on the RT‐Gene dataset, the lowest mean angular errors of 6.73°, 6.83°, and 7.75° were obtained. Here, a novel approach called feature map similarity‐based knowledge transfer for low‐resolution gaze estimation (FSKT‐GE) is proposed. The motivation behind this work is to address the challenge of accurately estimating gaze direction for low‐resolution facial images encountered in unconstrained outdoor environments.
ISSN:1751-9659
1751-9667
DOI:10.1049/ipr2.13056