FSKT‐GE: Feature maps similarity knowledge transfer for low‐resolution gaze estimation
The limited of texture details information in low‐resolution facial or eye images presents a challenge for gaze estimation. To address this, FSKT‐GE (feature maps similarity knowledge transfer for low‐resolution gaze estimation) is proposed, a gaze estimation framework consisting of both a high reso...
Saved in:
Published in | IET image processing Vol. 18; no. 6; pp. 1642 - 1654 |
---|---|
Main Authors | , , , , , , |
Format | Journal Article |
Language | English |
Published |
Wiley
01.05.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The limited of texture details information in low‐resolution facial or eye images presents a challenge for gaze estimation. To address this, FSKT‐GE (feature maps similarity knowledge transfer for low‐resolution gaze estimation) is proposed, a gaze estimation framework consisting of both a high resolution (HR) network and low resolution (LR) network with the identical structure. Rather than mere feature imitation, this issue is addressed by assessing the cosine similarity of feature layers, emphasizing the distribution similarity between the HR and LR networks. This enables the LR network to acquire richer knowledge. This framework utilizes a combination loss function, incorporating cosine similarity measurement, soft loss based on probability distribution difference and gaze direction output, along with a hard loss from the LR network output layer. This approach on low‐resolution datasets derived from Gaze360 and RT‐Gene datasets is validated, demonstrating excellent performance in low‐resolution gaze estimation. Evaluations on low‐resolution images obtained through 2×, 4×, and 8× down‐sampling are conducted on two datasets. On the Gaze360 dataset, the lowest mean angular errors of 10.97°, 11.22°, and 13.61° were achieved, while on the RT‐Gene dataset, the lowest mean angular errors of 6.73°, 6.83°, and 7.75° were obtained.
Here, a novel approach called feature map similarity‐based knowledge transfer for low‐resolution gaze estimation (FSKT‐GE) is proposed. The motivation behind this work is to address the challenge of accurately estimating gaze direction for low‐resolution facial images encountered in unconstrained outdoor environments. |
---|---|
ISSN: | 1751-9659 1751-9667 |
DOI: | 10.1049/ipr2.13056 |