Papers
Topics
Authors
Recent
Detailed Answer
Quick Answer
Concise responses based on abstracts only
Detailed Answer
Well-researched responses based on abstracts and relevant paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses
Gemini 2.5 Flash
Gemini 2.5 Flash 47 tok/s
Gemini 2.5 Pro 41 tok/s Pro
GPT-5 Medium 28 tok/s Pro
GPT-5 High 25 tok/s Pro
GPT-4o 104 tok/s Pro
Kimi K2 156 tok/s Pro
GPT OSS 120B 474 tok/s Pro
Claude Sonnet 4 36 tok/s Pro
2000 character limit reached

Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance (2312.07530v3)

Published 12 Dec 2023 in cs.CV

Abstract: Weakly supervised 3D object detection aims to learn a 3D detector with lower annotation cost, e.g., 2D labels. Unlike prior work which still relies on few accurate 3D annotations, we propose a framework to study how to leverage constraints between 2D and 3D domains without requiring any 3D labels. Specifically, we employ visual data from three perspectives to establish connections between 2D and 3D domains. First, we design a feature-level constraint to align LiDAR and image features based on object-aware regions. Second, the output-level constraint is developed to enforce the overlap between 2D and projected 3D box estimations. Finally, the training-level constraint is utilized by producing accurate and consistent 3D pseudo-labels that align with the visual data. We conduct extensive experiments on the KITTI dataset to validate the effectiveness of the proposed three constraints. Without using any 3D labels, our method achieves favorable performance against state-of-the-art approaches and is competitive with the method that uses 500-frame 3D annotations. Code will be made publicly available at https://github.com/kuanchihhuang/VG-W3D.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (27)
  1. End-to-end object detection with transformers. In European Conference on Computer Vision (ECCV), 2020.
  2. Emerging properties in self-supervised vision transformers. In IEEE International Conference on Computer Vision (ICCV), 2021.
  3. 3d object proposals for accurate object class detection. In Advances in Neural Information Processing Systems (NeurIPS), 2015.
  4. Voxelnext: Fully sparse voxelnet for 3d object detection and tracking. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
  5. Voxel r-cnn: Towards high performance voxel-based 3d object detection. AAAI Conference on Artificial Intelligence (AAAI), 2021.
  6. Are we ready for autonomous driving? the kitti vision benchmark suite. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012.
  7. Structure aware single-stage 3d object detection from point cloud. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
  8. Pg-rcnn: Semantic surface point generation for 3d object detection. In IEEE International Conference on Computer Vision (ICCV), 2023.
  9. Pointpillars: Fast encoders for object detection from point clouds. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
  10. Unifying voxel-based representation with transformer for 3d object detection. In Advances in Neural Information Processing Systems (NeurIPS), 2022.
  11. Microsoft coco: Common objects in context. arXiv preprint arXiv:1405.0312, 2014.
  12. Multimodal transformer for automatic 3d annotation and object detection. In European Conference on Computer Vision (ECCV), 2022.
  13. Map-gen: An automated 3d-box annotation flow with multimodal attention point generator. In ICPR, 2022.
  14. M3dssd: Monocular 3d single stage object detector. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
  15. Voxel transformer for 3d object detection. IEEE International Conference on Computer Vision (ICCV), 2021.
  16. Towards a weakly supervised framework for 3d point cloud object detection and annotation. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020.
  17. Weakly supervised 3d object detection from lidar point cloud. In European Conference on Computer Vision (ECCV), 2020.
  18. 3d object detection with pointformer. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
  19. Generalized intersection over union: A metric and a loss for bounding box regression. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
  20. Pointrcnn: 3d object proposal generation and detection from point cloud. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
  21. Point-gnn: Graph neural network for 3d object detection in a point cloud. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
  22. Disentangling monocular 3d object detection. In IEEE International Conference on Computer Vision (ICCV), 2019.
  23. Cut and learn for unsupervised object detection and instance segmentation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
  24. FGR: Frustum-Aware Geometric Reasoning for Weakly Supervised 3D Vehicle Detection. In IEEE International Conference on Robotics and Automation (ICRA), 2021.
  25. Cia-ssd: Confident iou-aware single-stage object detector from point cloud. In AAAI Conference on Artificial Intelligence (AAAI), 2021.
  26. Objects as points. In arXiv preprint arXiv:1904.07850, 2019.
  27. Voxelnet: End-to-end learning for point cloud based 3d object detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
Citations (2)
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-Up Questions

We haven't generated follow-up questions for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com