Papers
Topics
Authors
Recent
Detailed Answer
Quick Answer
Concise responses based on abstracts only
Detailed Answer
Well-researched responses based on abstracts and relevant paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses
Gemini 2.5 Flash
Gemini 2.5 Flash 62 tok/s
Gemini 2.5 Pro 48 tok/s Pro
GPT-5 Medium 14 tok/s Pro
GPT-5 High 13 tok/s Pro
GPT-4o 93 tok/s Pro
Kimi K2 213 tok/s Pro
GPT OSS 120B 458 tok/s Pro
Claude Sonnet 4 38 tok/s Pro
2000 character limit reached

Confidence-Aware RGB-D Face Recognition via Virtual Depth Synthesis (2403.06529v2)

Published 11 Mar 2024 in cs.CV

Abstract: 2D face recognition encounters challenges in unconstrained environments due to varying illumination, occlusion, and pose. Recent studies focus on RGB-D face recognition to improve robustness by incorporating depth information. However, collecting sufficient paired RGB-D training data is expensive and time-consuming, hindering wide deployment. In this work, we first construct a diverse depth dataset generated by 3D Morphable Models for depth model pre-training. Then, we propose a domain-independent pre-training framework that utilizes readily available pre-trained RGB and depth models to separately perform face recognition without needing additional paired data for retraining. To seamlessly integrate the two distinct networks and harness the complementary benefits of RGB and depth information for improved accuracy, we propose an innovative Adaptive Confidence Weighting (ACW). This mechanism is designed to learn confidence estimates for each modality to achieve modality fusion at the score level. Our method is simple and lightweight, only requiring ACW training beyond the backbone models. Experiments on multiple public RGB-D face recognition benchmarks demonstrate state-of-the-art performance surpassing previous methods based on depth estimation and feature fusion, validating the efficacy of our approach.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. Challenges in determining the depth in 2-d images. In 2022 International Conference on INnovations in Intelligent SysTems and Applications (INISTA), pages 1–6. IEEE, 2022.
  2. A morphable model for the synthesis of 3d faces. In Seminal Graphics Papers: Pushing the Boundaries, Volume 2, pages 157–164. 1999.
  3. Mobilefacenets: Efficient cnns for accurate real-time face verification on mobile devices. In Biometric Recognition: 13th Chinese Conference, CCBR 2018, Urumqi, China, August 11-12, 2018, Proceedings 13, pages 428–438. Springer, 2018.
  4. High-accuracy rgb-d face recognition via segmentation-aware face depth estimation and mask-guided attention network. In 2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021), pages 1–8, Dec. 2021.
  5. Rgb-d face recognition with identity-style disentanglement and depth augmentation. IEEE Transactions on Biometrics, Behavior, and Identity Science, pages 1–1, 2023.
  6. Rgb-d face recognition via learning-based reconstruction. In 2016 IEEE 8th International Conference on Biometrics Theory, Applications and Systems (BTAS), pages 1–7. IEEE, 2016.
  7. Improving 2d face recognition via discriminative face depth estimation. In 2018 International Conference on Biometrics (ICB), pages 140–147, Feb. 2018.
  8. Arcface: Additive angular margin loss for deep face recognition. IEEE TPAMI, 44:5962–5979, 2018.
  9. Learning confidence for out-of-distribution detection in neural networks. arXiv preprint arXiv:1802.04865, 2018.
  10. Morphable face models-an open framework. In 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), pages 75–82. IEEE, 2018.
  11. Syed Zulqarnain Gilani. Learning from millions of 3d scans for large-scale 3d face recognition. In CVPR, pages 1896–1905, June 2018.
  12. On rgb-d face recognition using kinect. In 2013 IEEE Sixth International Conference on Biometrics: Theory, Applications and Systems (BTAS), pages 1–6. IEEE, 2013.
  13. Towards 3d face recognition in the real: a registration-free approach using fine-grained matching of 3d keypoint descriptors. IJCV, 113:128–142, 2015.
  14. Learning a model of facial shape and expression from 4d scans. ACM TOG, 36(6):194–1, 2017.
  15. High quality facial data synthesis and fusion for 3d low-quality face recognition. In 2021 IEEE International Joint Conference on Biometrics (IJCB), pages 1–8. IEEE, 2021.
  16. An efficient multimodal 2d-3d hybrid approach to automatic face recognition. IEEE TPAMI, 29(11):1927–1943, Nov. 2007.
  17. Led3d: A lightweight and efficient deep approach to recognizing low-quality 3d faces. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5773–5782, 2019.
  18. Face reconstruction from skull shapes and physical attributes. In Pattern Recognition: 31st DAGM Symposium, Jena, Germany, September 9-11, 2009. Proceedings 31, pages 232–241. Springer, 2009.
  19. Bosphorus database for 3d face analysis. In Biometrics and Identity Management: First European Workshop, BIOID 2008, Roskilde, Denmark, May 7-9, 2008. Revised Selected Papers 1, pages 47–56. Springer, 2008.
  20. Face recognition: a novel multi-level taxonomy based survey. IET Biometrics, 9(2):58–67, 2020.
  21. Depth as attention for face representation learning. IEEE Transactions on Information Forensics and Security, 16:2461–2476, 2021.
  22. Teacher-student adversarial depth hallucination to improve face recognition. In ICCV, pages 3671–3680, 2021.
  23. Two-level attention-based fusion learning for rgb-d face recognition. In ICPR, pages 10120–10127. IEEE, 2021.
  24. Improving rgb-d face recognition via transfer learning from a pretrained 2d network. In International Symposium on Benchmarking, Measuring and Optimization, pages 141–148. Springer, 2019.
  25. Learning face representation from scratch. arXiv preprint arXiv:1411.7923, 2014.
  26. Rgb-d face recognition via deep complementary and common feature learning. In 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), pages 8–15. IEEE, 2018.
  27. Lock3dface: A large-scale database of low-cost kinect 3d faces. In 2016 International Conference on Biometrics (ICB), pages 1–8. IEEE, 2016.
  28. Lmfnet: A lightweight multiscale fusion network with hierarchical structure for low-quality 3-d face recognition. IEEE Transactions on Human-Machine Systems, 2022.
  29. Exploiting enhanced and robust rgb-d face representation via progressive multi-modal learning. Pattern Recognition Letters, 166:38–45, 2023.

Summary

We haven't generated a summary for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Lightbulb On Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube