3D Face Recognition Based on Key Feature Enhancement Mechanism

WANG Qi; QIAN Weizhong; LEI Hang; WANG Xupeng

doi:10.12178/1001-0548.2023012

Volume 53 Issue 2

Apr. 2024

Article Contents

Article Navigation > Journal of University of Electronic Science and Technology of China > 2024 > 53(2): 252-258

WANG Qi, QIAN Weizhong, LEI Hang, WANG Xupeng. 3D Face Recognition Based on Key Feature Enhancement Mechanism[J]. Journal of University of Electronic Science and Technology of China, 2024, 53(2): 252-258. doi: 10.12178/1001-0548.2023012

Citation:

WANG Qi, QIAN Weizhong, LEI Hang, WANG Xupeng. 3D Face Recognition Based on Key Feature Enhancement Mechanism[J]. Journal of University of Electronic Science and Technology of China, 2024, 53(2): 252-258. doi: 10.12178/1001-0548.2023012

3D Face Recognition Based on Key Feature Enhancement Mechanism

doi: 10.12178/1001-0548.2023012

WANG Qi^{1
,
,},
QIAN Weizhong^{1, 2},
LEI Hang¹,
WANG Xupeng¹

1.
School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China
2.
Kash Institutec of Electronics and Information Industry, Kash 844000, China

Received Date: 2023-01-06
Rev Recd Date: 2023-07-02

Available Online: 2024-04-01

Publish Date: 2024-03-30

Abstract

3D face recognition is an important part of the field of computer vision. Pointnet relies on deep learning to solve the disorder of point clouds and realize the global feature extraction. However, due to the lack of detailed texture of point clouds, it is difficult to realize face recognition in complex situations only by global features. In deal with the above problems, a local feature descriptor is proposed to describe the local spatial geometric features of the point clouds, and a key feature enhancement mechanism is introduced to enhance the key features of the face through the probability distribution, which can reduce the interference of unnecessary features and effectively improve the accuracy of the model. Experiments were carried out on public data sets CASIA-3D, Lock3DFace and Bosphorus. The results show that our method can deal well with the change of expression, partial occlusion and interference of head pose, especially in weak light conditions, compared with RP-Net, the accuracy is improved by 1.1 percent, and the method also has good real-time performance.
- 3D face recognition,
- deep learning,
- local feature descriptor,
- feature enhancement,
- point cloud

References

[1]	BORGHI G, FABBRI M, VEZZANI R, et al. Face-from-depth for head pose estimation on depth images[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2018, 42(3): 596-609.
[2]	QI C R, SU H, MO K, et al. Pointnet: Deep learning on point sets for 3D classification and segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 652-660.
[3]	QI C R, YI L, SU H, et al. Pointnet++: Deep hierarchical feature learning on point sets in a metric space[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach: [s.n.], 2017: 5105-5114.
[4]	DENG H, BIRDAL T, ILIC S. PPFNet: Global context aware local features for robust 3D point matching[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 195-205.
[5]	LI Y, BU R, SUN M, et al. PointCNN: Convolution on x-transformed[EB/OL]. [2023-01-04]. https://proceedings.neurips.cc/paper/2018/file/f5f8590cd58a54e94377e6ae2eded4d9-Paper.pdf.
[6]	XIE Z, CHEN J, PENG B. Point clouds learning with attention-based graph convolution networks[J]. Neurocomputing, 2020, 402: 245-255. doi: 10.1016/j.neucom.2020.03.086
[7]	ZHAO H, JIANG L, JIA J, et al. Point transformer[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Beijing: IEEE, 2021: 16259-16268.
[8]	肖仕华, 桑楠, 王旭鹏. 基于深度学习的三维点云头部姿态估计[J]. 计算机应用, 2020, 40(4): 996-1001. XIAO S H, SANG N, WANG X P. 3D point cloud head pose estimation based on deep learning[J]. Journal of Computer Applications, 2020, 40(4): 996-1001.
[9]	XIAO S H, SANG N, WANG X P, et al. Leveraging ordinal regression with soft labels for 3D head pose estimation from point sets[C]//ICASSP 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Barcelona: IEEE, 2020: 1883-1887.
[10]	WANG Q, LEI H, WANG X. Deep face verification under posture interference[J]. Journal of Computer Applications, 2022, 43: 595-600.
[11]	CAO Y, LIU S, ZHAO P, et al. RP-Net: PointNet++ 3D face recognition algorithm integrating RoPS local descriptor[J]. IEEE Access, 2022, 10: 91245-91252. doi: 10.1109/ACCESS.2022.3202216
[12]	XU K, WANG X, HU Z, et al. 3D face recognition based on twin neural network combining deepmap and texture[C]//IEEE 19th International Conference on Communication Technology (ICCT). Xi’an: IEEE, 2019: 1665-1668.
[13]	DUTTA K, BHATTACHARJEE D, NASIPURI M. SPPCANet: A simple deep learning-based feature extraction approach for 3D face recognition[J]. Multimedia Tools and Applications, 2020, 79(41): 31329-31352.
[14]	GERA D, BALASUBRAMANIAN S. Landmark guidance independent spatio-channel attention and complementary context information based facial expression recognition[J]. Pattern Recognition Letters, 2021, 145: 58-66. doi: 10.1016/j.patrec.2021.01.029
[15]	SCHROFF F, KALENICHENKO D, PHILBIN J. Facenet: A unified embedding for face recognition and clustering[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Boston: IEEE, 2015: 815-823.
[16]	BIOMETRICS C. Note on CASIA-3D FaceV1[EB/OL]. [2023-01-04]. http://biometrics.idealtest.org.
[17]	ZHANG J, HUANG D, WANG Y, et al. Lock3DFace: A large-scale database of low-cost kinect 3D faces[C]//2016 International Conference on Biometrics (ICB). Halmstad: IEEE, 2016: 1-8.
[18]	SAVRAN A, ALYÜZ N, DIBEKLIOĞLU H, et al. Bosphorus database for 3D face analysis[C]//Biometrics and Identity Management: First European Workshop, BIOID 2008. Berlin: Springer, 2008: 47-56.
[19]	CHANDRAKALA M, RAVI S. Effective 3D face recognition technique based on Gabor and LTP features[J]. International Journal of Engineering and Advanced Technology (IJEAT), 2018, 8(2): 284-290.
[20]	HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas: IEEE, 2016: 770-778.
[21]	SZEGEDY C, VANHOUCKE V, IOFFE S J, et al. Rethinking the inception architecture for computer vision[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas: IEEE, 2016: 2818-2826.
[22]	SANDLER M, HOWARD A, ZHU M, et al. Mobilenetv2: Inverted residuals and linear bottlenecks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 4510-4520.
[23]	MU G, HUANG D, HU G, et al. Led3D: A lightweight and efficient deep approach to recognizing low-quality 3D faces[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Los Angeles: IEEE, 2019: 5766-5775.
[24]	SOLTANPOUR S, WU Q J. High-order local normal derivative pattern (LNDP) for 3D face recognition[C]//IEEE International Conference on Image Processing (ICIP). Beijing: IEEE, 2017: 2811-2815.
[25]	DUTTA K, BHATTACHARJEE D, NASIPURI M, et al. 3D face recognition based on volumetric representation of range image[J]. Advanced Computing and Systems for Security, 2019, 8: 175-189.

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(7) / Tables(6)

Get Citation

PDF

XML

Article Metrics

Article views(3863) PDF downloads(36) Cited by()

Proportional views

HTML

人脸识别作为计算机视觉领域的重要组成部分，被广泛应用于日常生活中。但普通RGB图像的成像质量依赖于光照条件，这使得在一些缺乏光照的场景很难通过普通相机获取有效信息，如夜间驾驶、监视和跟踪等^[1]。随着三维相机的发展，如Kinect，获取高质量三维图像变得更加便捷，基于红外的成像原理，使其对光照变化保持稳定，且每个像素点代表了传感器到目标的距离，能有效地刻画目标的空间信息。通过简单的坐标转化，能实现三维图像到三维点云数据的转变。基于点云的人脸识别有以下两个优势：一是数据本身对光照变化保持稳定，使其能有效地应用于缺乏光照或光照变化大的场景；二是能简单地根据距离信息实现背景分离，减少背景对目标的干扰，使任务能更好地聚焦于对象本身^[1]。

文献[2] 提出了Pointnet，该网络解决了点云的无序性以及在深度学习中的应用，有效地通过深度神经网络提取点云的全局特征并应用于点云的分类以及分割任务。文献[3]提出Pointnet++通过点云的空间位置信息对点云分组，实现了点云特征的分层次提取。文献[4]通过融合局部和全局特征提出了PPFNet，该模型丰富了点云特征的表达方式。文献[5]通过X-conv 算子解决了点云在卷积神经网络中的应用。文献[6]基于图卷积网络构建了一个收敛快、计算复杂度低的点云特征提取网络。近年来，更多的网络模型被应用于三维点云，如文献[7]为点云设计了自注意层，并使用这些自注意层来构建语义场景分割和分类等任务的深度学习网络。

随着基于点云的深度学习模型的发展，更多的3D点云被应用于人脸分析任务，文献[8-9]利用Pointnet++提取3D人脸全局特征，实现了高效的人脸姿态估计。文献[10]构建全新的孪生网络用于姿态干扰下的人脸验证。文献[11-12]采用二维特征互补三维点云信息用于人脸识别。文献[13]提出了一个系数成分分析网络用于三维人脸特征提取。文献[14]引入注意力机制结合上下文信息用于面部表情分析。大量基于点云的人脸分析工作取得了突破性进展，但是，由于点云数据自身缺乏细节纹理，如何在面部表情和姿态干扰以及部分遮挡等复杂情况下有效地提取特征或强化关键信息依然是该领域研究的重点。

针对以上问题，本文采用三维点云数据作为输入，构建一个新的深度学习模型用于三维人脸识别。首先将点云数据进行分组，利用一个局部特征描述子获取每组点云的空间几何信息，再利用Pointnet提取局部特征，然后通过最大池化层整合局部特征，获取全局特征。

为增强关键特征的比重，本文采用关键特征强化机制将获取的全局特征映射为特征空间中的概率分布并通过增强系数强化特征的辨识度，从而减少面部表情以及头部姿态等对任务的干扰。由于点云数据本身对光照变化保持稳定，本方法同样适用于缺乏光照或光照变化大的场景。

4. 结束语

为了解决在光照变化、面部表情、头部姿态和部分遮挡下的人脸识别问题，本文直接采用3D点云作为输入，并引入局部特征描述子和关键特征增强机制提升了特征的辨识度，有效地提高了模型的识别准确率，并保持了良好的实时性。该方法对光照变化保持稳定，可以应用于缺乏光照或光照变化较大的场景，如夜间驾驶认证、夜间检测等。

后期将进一步研究新的算法，提升三维人脸的特征辨识度，并应用于其他复杂情况下的人脸分析任务，如头部检测、姿态估计和人脸验证等。

Reference (25)

[1]	BORGHI G, FABBRI M, VEZZANI R, et al. Face-from-depth for head pose estimation on depth images[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2018, 42(3): 596-609.
[2]	QI C R, SU H, MO K, et al. Pointnet: Deep learning on point sets for 3D classification and segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 652-660.
[3]	QI C R, YI L, SU H, et al. Pointnet++: Deep hierarchical feature learning on point sets in a metric space[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach: [s.n.], 2017: 5105-5114.
[4]	DENG H, BIRDAL T, ILIC S. PPFNet: Global context aware local features for robust 3D point matching[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 195-205.
[5]	LI Y, BU R, SUN M, et al. PointCNN: Convolution on x-transformed[EB/OL]. [2023-01-04]. https://proceedings.neurips.cc/paper/2018/file/f5f8590cd58a54e94377e6ae2eded4d9-Paper.pdf.
[6]	XIE Z, CHEN J, PENG B. Point clouds learning with attention-based graph convolution networks[J]. Neurocomputing, 2020, 402: 245-255.
[7]	ZHAO H, JIANG L, JIA J, et al. Point transformer[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Beijing: IEEE, 2021: 16259-16268.
[8]	肖仕华, 桑楠, 王旭鹏. 基于深度学习的三维点云头部姿态估计[J]. 计算机应用, 2020, 40(4): 996-1001.	XIAO S H, SANG N, WANG X P. 3D point cloud head pose estimation based on deep learning[J]. Journal of Computer Applications, 2020, 40(4): 996-1001.
[9]	XIAO S H, SANG N, WANG X P, et al. Leveraging ordinal regression with soft labels for 3D head pose estimation from point sets[C]//ICASSP 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Barcelona: IEEE, 2020: 1883-1887.
[10]	WANG Q, LEI H, WANG X. Deep face verification under posture interference[J]. Journal of Computer Applications, 2022, 43: 595-600.
[11]	CAO Y, LIU S, ZHAO P, et al. RP-Net: PointNet++ 3D face recognition algorithm integrating RoPS local descriptor[J]. IEEE Access, 2022, 10: 91245-91252.
[12]	XU K, WANG X, HU Z, et al. 3D face recognition based on twin neural network combining deepmap and texture[C]//IEEE 19th International Conference on Communication Technology (ICCT). Xi’an: IEEE, 2019: 1665-1668.
[13]	DUTTA K, BHATTACHARJEE D, NASIPURI M. SPPCANet: A simple deep learning-based feature extraction approach for 3D face recognition[J]. Multimedia Tools and Applications, 2020, 79(41): 31329-31352.
[14]	GERA D, BALASUBRAMANIAN S. Landmark guidance independent spatio-channel attention and complementary context information based facial expression recognition[J]. Pattern Recognition Letters, 2021, 145: 58-66.
[15]	SCHROFF F, KALENICHENKO D, PHILBIN J. Facenet: A unified embedding for face recognition and clustering[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Boston: IEEE, 2015: 815-823.
[16]	BIOMETRICS C. Note on CASIA-3D FaceV1[EB/OL]. [2023-01-04]. http://biometrics.idealtest.org.
[17]	ZHANG J, HUANG D, WANG Y, et al. Lock3DFace: A large-scale database of low-cost kinect 3D faces[C]//2016 International Conference on Biometrics (ICB). Halmstad: IEEE, 2016: 1-8.
[18]	SAVRAN A, ALYÜZ N, DIBEKLIOĞLU H, et al. Bosphorus database for 3D face analysis[C]//Biometrics and Identity Management: First European Workshop, BIOID 2008. Berlin: Springer, 2008: 47-56.
[19]	CHANDRAKALA M, RAVI S. Effective 3D face recognition technique based on Gabor and LTP features[J]. International Journal of Engineering and Advanced Technology (IJEAT), 2018, 8(2): 284-290.
[20]	HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas: IEEE, 2016: 770-778.
[21]	SZEGEDY C, VANHOUCKE V, IOFFE S J, et al. Rethinking the inception architecture for computer vision[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas: IEEE, 2016: 2818-2826.
[22]	SANDLER M, HOWARD A, ZHU M, et al. Mobilenetv2: Inverted residuals and linear bottlenecks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 4510-4520.
[23]	MU G, HUANG D, HU G, et al. Led3D: A lightweight and efficient deep approach to recognizing low-quality 3D faces[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Los Angeles: IEEE, 2019: 5766-5775.
[24]	SOLTANPOUR S, WU Q J. High-order local normal derivative pattern (LNDP) for 3D face recognition[C]//IEEE International Conference on Image Processing (ICIP). Beijing: IEEE, 2017: 2811-2815.
[25]	DUTTA K, BHATTACHARJEE D, NASIPURI M, et al. 3D face recognition based on volumetric representation of range image[J]. Advanced Computing and Systems for Security, 2019, 8: 175-189.

感受野k	准确率/%
0.2	86.6
0.4	89.2
0.6	82.7
0.8	77.9
1	73.6

增强参数$ \lambda $	准确率/%	增强参数$ \lambda $	准确率/%
0	89.2	55	98.9
5	89.2	60	98.8
10	89.1	65	98.7
15	89.5	70	98.3
20	91.7	75	98.1
25	92.9	80	98.1
30	94.8	85	97.7
35	96.3	90	97.1
40	97.6	95	96.5
45	98.0	100	95.6

方法	正常光照	弱光环境
文献[19]	98.4	96.04
文献[13]	97.9	95.4
文献[11]	97.9	97.3
本文方法	98.9	98.4

方法	干扰序列下的准确率				总计
方法	FE	NU	OC	PS	总计
文献[20]	96.1	99.3	54.9	61.4	76.6
文献[21]	93.6	99.0	57.0	54.1	74.4
文献[22]	95.7	98.9	61.4	69.9	79.5
文献[23]中方法	98.1	99.6	78.1	70.4	84.2
本文方法	98.5	99.2	78.7	70.0	83.6

方法	正常光照	弱光环境
文献[24]	97.3	95.2
文献[25]	95.2	95.1
文献[13]	98.5	96.5
文献[11]中方法	98.0	97.2
本文方法	98.4	98.3

3D Face Recognition Based on Key Feature Enhancement Mechanism

doi: 10.12178/1001-0548.2023012

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views