Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 134 tok/s
Gemini 2.5 Pro 41 tok/s Pro
GPT-5 Medium 30 tok/s Pro
GPT-5 High 26 tok/s Pro
GPT-4o 64 tok/s Pro
Kimi K2 185 tok/s Pro
GPT OSS 120B 442 tok/s Pro
Claude Sonnet 4.5 37 tok/s Pro
2000 character limit reached

Stereo-Matching Knowledge Distilled Monocular Depth Estimation Filtered by Multiple Disparity Consistency (2401.12019v2)

Published 22 Jan 2024 in cs.CV

Abstract: In stereo-matching knowledge distillation methods of the self-supervised monocular depth estimation, the stereo-matching network's knowledge is distilled into a monocular depth network through pseudo-depth maps. In these methods, the learning-based stereo-confidence network is generally utilized to identify errors in the pseudo-depth maps to prevent transferring the errors. However, the learning-based stereo-confidence networks should be trained with ground truth (GT), which is not feasible in a self-supervised setting. In this paper, we propose a method to identify and filter errors in the pseudo-depth map using multiple disparity maps by checking their consistency without the need for GT and a training process. Experimental results show that the proposed method outperforms the previous methods and works well on various configurations by filtering out erroneous areas where the stereo-matching is vulnerable, especially such as textureless regions, occlusion boundaries, and reflective surfaces.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (23)
  1. “Unsupervised cnn for single view depth estimation: Geometry to the rescue,” in Proc. ECCV. Springer, 2016, pp. 740–756.
  2. “Unsupervised monocular depth estimation with left-right consistency,” in Proc. CVPR, 2017, pp. 270–279.
  3. “Digging into self-supervised monocular depth estimation,” in Proc. CVPR, 2019, pp. 3828–3838.
  4. “Learning monocular depth by distilling cross-domain stereo networks,” in Proc. ECCV, 2018, pp. 484–500.
  5. “A large rgb-d dataset for semi-supervised monocular depth estimation,” arXiv preprint arXiv:1904.10230, 2019.
  6. “Unsupervised domain adaptation for depth prediction from images,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 42, no. 10, pp. 2396–2409, 2019.
  7. “Adaptive confidence thresholding for monocular depth estimation,” in Proc. ICCV, October 2021, pp. 12808–12818.
  8. Matteo Poggi and S. Mattoccia, “Learning from scratch a confidence measure,” in Proc. BMVC, 2016.
  9. “Laf-net: Locally adaptive fusion networks for stereo confidence estimation,” in Proc. CVPR, 2019.
  10. “Reduction of aliasing artifacts by sign function approximation in light field depth estimation based on foreground–background separation,” IEEE Signal Process. Lett., vol. 25, no. 11, pp. 1750–1754, 2018.
  11. “Complex-valued disparity: Unified depth model of depth from stereo, depth from focus, and depth from defocus based on the light field gradient,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 43, no. 3, pp. 830–841, 2021.
  12. “Learning stereo from single images,” in Proc. ECCV. Springer, 2020, pp. 722–740.
  13. “On the uncertainty of self-supervised monocular depth estimation,” in Proc. CVPR, 2020.
  14. “Learning monocular depth estimation infusing traditional stereo knowledge,” in Proc. CVPR, June 2019.
  15. “Self-supervised monocular depth hints,” in Proc. ICCV, 2019, pp. 2162–2171.
  16. “Very deep convolutional networks for large-scale image recognition,” arXiv preprint arXiv:1409.1556, 2014.
  17. “Deep residual learning for image recognition,” in Proc. CVPR, 2016, pp. 770–778.
  18. “Learning to adapt for stereo,” in Proc. CVPR, 2019, pp. 9661–9670.
  19. “Depth map prediction from a single image using a multi-scale deep network,” in Advances in Neural Information Processing Systems, 2014, vol. 27.
  20. “Vision meets robotics: The kitti dataset,” Int. J. Robot. Res., 2013.
  21. “The cityscapes dataset for semantic urban scene understanding,” in Proc. CVPR, 2016.
  22. “Learning from scratch a confidence measure,” in Proceedings of the British Machine Vision Conference (BMVC), Edwin R. Hancock Richard C. Wilson and William A. P. Smith, Eds. September 2016, pp. 46.1–46.13, BMVA Press.
  23. “Beyond local reasoning for stereo confidence estimation with deep learning,” in Proceedings of the European Conference on Computer Vision (ECCV), September 2018.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube