improving rgb d face recognition via transferring
play

Improving RGB-D face recognition via transferring pretrained 2D - PowerPoint PPT Presentation

Improving RGB-D face recognition via transferring pretrained 2D networks Xingwang Xiong, Xu Wen, and Cheng Huang INSTITUTE O https://github.com/XingwXiong/Face3D-Pytorch OF C COMPUTING T TECHNOLOGY 3D Face Recognition Algorithm Challenge


  1. Improving RGB-D face recognition via transferring pretrained 2D networks Xingwang Xiong, Xu Wen, and Cheng Huang INSTITUTE O https://github.com/XingwXiong/Face3D-Pytorch OF C COMPUTING T TECHNOLOGY 3D Face Recognition Algorithm Challenge (3DFRAC) ICT, Chinese Academy of Sciences

  2. Face Representations 3D Face Images RGB image Point cloud 3D Mesh Depth image 3DFRAC Bench 19

  3. 3D Face Recognition Algorithm Challenge n RGB-D Face Recognition n The value in the depth image reflects the distance of scene object surface from the viewpoint . RGB-D image Depth camera (Kinect V2) RGB image Depth image 3DFRAC Bench 19

  4. Why do we need RGB-D FR ? n 2D FR is sensitive to external variations n Poses n Facial expressions n Illuminations n Extra low-level patterns on depth images n Smooth variations n Contracts n Borders & global layouts n Face Anti-spoofing (ICPR 2018) 3DFRAC Bench 19

  5. Open-set vs. Closed-set FR n Open-set FR n Classification n Closed-set FR n Face embedding n Similarity comparison • SVM • KNN n 3DFRAC n a closed-set problem 3DFRAC Bench 19 CVPR 2017

  6. RGB Images vs. Depth Images RGB images Depth images n High frequency patterns n Low frequency patterns n Textures & Details n Smooth variations n Contracts n Borders & Global layouts n Easy to obtain n Massive scale n Not enough to learn a deep n ∼ 3.3 million faces 1 CNN n ∼ 9K identities 1 n ∼ 403K million faces 2 n ∼ 1.2K identities 2 1. VGGFace2 2. Intellifusion RGB-D face dataset 3DFRAC Bench 19

  7. Goal n To leverage both conventional RGB-based works and depth features 3DFRAC Bench 19

  8. Inter-modal Transfer Learning 2D pretrained network RGB-D network Copy weights 3DFRAC Bench 19

  9. Inter-modal Transfer Learning n Use ResNet-50 as the backbone network n Copy pretrained weights of middle layers n Fine-tune the whole model with 224x224 RGB-D images 3DFRAC Bench 19

  10. Preprocessing n Face Detection & Alignment n MTCNN (SPL 2016) n Randomly horizontally flipping n Normalizing n 0-1 range Face Detection MTCNN Locate the face 3DFRAC Bench 19

  11. Results n 94.64% accuracy on the Intellifusion RGB-D dataset n Won 1 st on 3DFRAC Method Input data CNN Models Accuracy (%) Pretrained on ImageNet RGB images ResNet-50 94.47 Train from scratch RGB-D images RGB-D ResNet50 88.36 Pretrained on ImageNet RGB-D images RGB-D ResNet50 94.64 3DFRAC Bench 19

  12. Conclusions n Inter-modal transfer learning from pretrained 2D networks to RGB-D networks improves recognition accuracy n Code is open-sourced n https://github.com/XingwXiong/Face3D-Pytorch n Email n xiongxingwang@ict.ac.cn 3DFRAC Bench 19

  13. 3DFRAC Bench 19

Recommend


More recommend