DSNet: Dual-stream multi-scale fusion network for low-quality 3D face recognition

3D face recognition (FR) has become increasingly widespread due to the illumination invariance and pose robustness of 3D face data. Most existing 3D FR methods can only achieve excellent performance on complete and smooth faces. However, low-quality 3D FR with missing facial features still suffers f...

Full description

Saved in:

Bibliographic Details
Published in	AIP advances Vol. 13; no. 8; pp. 085310 - 085310-13
Main Authors	Zhao, Panzi, Ming, Yue, Hu, Nannan, Lyu, Boyang, Zhou, Jiangwan
Format	Journal Article
Language	English
Published	Melville American Institute of Physics 01.08.2023 AIP Publishing LLC
Subjects	Datasets Face recognition Feature extraction Modules Redundancy
Online Access	Get full text

Cover

Loading…

More Information
Summary:	3D face recognition (FR) has become increasingly widespread due to the illumination invariance and pose robustness of 3D face data. Most existing 3D FR methods can only achieve excellent performance on complete and smooth faces. However, low-quality 3D FR with missing facial features still suffers from insufficient discriminative feature extraction for visible face regions. In this paper, we propose a dual-stream multi-scale fusion network (DSNet) for low-quality 3D FR. First, in the first stream, we design a new multi-scale local and global feature fusion network, which consists of an enhanced shallow feature extraction module, an enhanced deep feature extraction module, and a layered multi-scale feature correlation fusion module, aiming to obtain more discriminative details and category information of the facial visible region, reducing the interference of similar features and the redundancy of the same features. Second, we also introduced a capsule network as the second stream to enhance the expression of 3D facial spatial position information, thereby further improving the performance of low-quality 3D FR with missing facial features. We conduct extensive experiments on low-quality datasets (Lock3DFace, KinectFaceDB, and IIIT-D) and cross-quality datasets synthesized by Bosphorus. These results show that our proposed DSNet can achieve state-of-the-art recognition performance and exhibit excellent performance on low-quality 3D faces with missing facial features.
ISSN:	2158-3226 2158-3226
DOI:	10.1063/5.0153077