BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects (2403.09799v2)
Abstract: We present the evaluation methodology, datasets and results of the BOP Challenge 2023, the fifth in a series of public competitions organized to capture the state of the art in model-based 6D object pose estimation from an RGB/RGB-D image and related tasks. Besides the three tasks from 2022 (model-based 2D detection, 2D segmentation, and 6D localization of objects seen during training), the 2023 challenge introduced new variants of these tasks focused on objects unseen during training. In the new tasks, methods were required to learn new objects during a short onboarding stage (max 5 minutes, 1 GPU) from provided 3D object models. The best 2023 method for 6D localization of unseen objects (GenFlow) notably reached the accuracy of the best 2020 method for seen objects (CosyPose), although being noticeably slower. The best 2023 method for seen objects (GPose) achieved a moderate accuracy improvement but a significant 43% run-time improvement compared to the best 2022 counterpart (GDRNPP). Since 2017, the accuracy of 6D localization of seen objects has improved by more than 50% (from 56.9 to 85.6 AR_C). The online evaluation system stays open and is available at: http://bop.felk.cvut.cz/.
- GPose2023, a submission to the BOP Challenge 2023. 2023. http://bop.felk.cvut.cz/method_info/410/.
- Learning 6D object pose estimation using 3D object coordinates. ECCV, 2014.
- Shapenet: An information-rich 3D model repository. arXiv preprint arXiv:1512.03012, 2015.
- 3d model-based zero-shot pose estimation pipeline. arXiv preprint arXiv:2305.17934, 2023.
- BlenderProc: Reducing the reality gap with photorealistic rendering. RSS Workshops, 2020.
- Blenderproc. arXiv preprint arXiv:1911.01911, 2019.
- Blenderproc2: A procedural pipeline for photorealistic rendering. Journal of Open Source Software, 8(82):4901, 2023.
- Recovering 6D object pose and predicting next-best-view in the crowd. CVPR, 2016.
- Google scanned objects: A high-quality dataset of 3D scanned household items. ICRA, 2022.
- Introducing MVTec ITODD – A dataset for 3D object recognition in industry. ICCVW, 2017.
- Model globally, match locally: Efficient and robust 3D object recognition. CVPR, 2010.
- YOLOX: Exceeding YOLO series in 2021. arXiv preprint arXiv:2107.08430, 2021.
- PointPoseNet: Accurate object detection and 6 DoF pose estimation in point clouds. arXiv preprint arXiv:1912.09057, 2019.
- Shape-constraint recurrent flow for 6d object pose estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4831–4840, 2023.
- Shape, light & material decomposition from images using monte carlo rendering and denoising. NeurIPS, 2022.
- SurfEmb: Dense and continuous correspondence distributions for object pose estimation with learnt surface embeddings. CVPR, 2022.
- Mask R-CNN. ICCV, 2017.
- Model based training, detection and pose estimation of texture-less 3D objects in heavily cluttered scenes. ACCV, 2012.
- On pre-trained image features and synthetic images for deep learning. ECCVW, 2018.
- Tomáš Hodaň. Pose estimation of specific rigid objects. PhD Thesis, Czech Technical University in Prague, 2021.
- EPOS: Estimating 6D pose of objects with symmetries. CVPR, 2020.
- BOP Challenge 2019. https://bop.felk.cvut.cz/media/bop_challenge_2019_results.pdf, 2019.
- T-LESS: An RGB-D dataset for 6D pose estimation of texture-less objects. WACV, 2017.
- On evaluation of 6D object pose estimation. ECCVW, 2016.
- BOP: Benchmark for 6D object pose estimation. ECCV, 2018.
- BOP Challenge 2020 on 6D object localization. ECCV, 2020.
- Photorealistic image synthesis for object instance detection. ICIP, 2019.
- Perspective flow aggregation for data-limited 6d object pose estimation. arXiv preprint arXiv:2203.09836, 2022.
- Ultralytics YOLO, Jan. 2023.
- HomebrewedDB: RGB-D dataset for 6D pose estimation of 3D objects. ICCVW, 2019.
- SSD-6D: Making RGB-based 3D detection and 6D pose estimation great again. ICCV, 2017.
- A hybrid approach for 6dof pose estimation. ECCVW, 2020.
- CosyPose: Consistent multi-view multi-object 6D pose estimation. ECCV, 2020.
- MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare. In CoRL, 2022.
- CDPN: Coordinates-based disentangled pose network for real-time rgb-based 6-dof object pose estimation. ICCV, 2019.
- Sam-6d: Segment anything model meets zero-shot 6d object pose estimation. arXiv preprint arXiv:2311.15707, 2023.
- Microsoft COCO: Common objects in context. ECCV, 2014.
- Coupled iterative refinement for 6d multi-object pose estimation. In CVPR, 2022.
- Leaping from 2D detection to efficient 6DoF object pose estimation. ECCVW, 2020.
- GDRNPP. https://github.com/shanice-l/gdrnpp_bop2022, 2022.
- GenFlow, a submission to the BOP Challenge 2023. Unpublished, 2023. http://bop.felk.cvut.cz/method_info/437/.
- Extracting triangular 3d models, materials, and lighting from images. CVPR, 2022.
- KinectFusion: Real-time dense surface mapping and tracking. ISMAR, 2011.
- CNOS: A Strong Baseline for CAD-based Novel Object Segmentation. In ICCVW, 2023.
- Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions. In CVPR, 2022.
- Dinov2: Learning robust visual features without supervision. arXiv preprint arXiv:2304.07193, 2023.
- Pix2Pose: Pixel-wise coordinate regression of objects for 6D pose estimation. ICCV, 2019.
- Geotransformer: Fast and robust point cloud registration with geometric transformer. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023.
- Using 2 point+normal sets for fast registration of point clouds with small overlap. ICRA, 2017.
- Common objects in 3D: Large-scale learning and evaluation of real-life 3D category reconstruction. In ICCV, 2021.
- A dataset for improved RGBD-based object detection and pose estimation for warehouse pick-and-place. RA-L, 2016.
- Deep segmentation leverages geometric pose estimation in computer-aided total knee arthroplasty. Healthcare Technology Letters, 2019.
- Osop: A multi-stage one shot object pose estimation framework. In CVPR, 2022.
- ZebraPose: Coarse to fine surface encoding for 6DoF object pose estimation. CVPR, 2022.
- Multi-path learning for object pose estimation across domains. CVPR, 2020.
- BOP challenge 2022 on detection, segmentation and pose estimation of specific rigid objects. CVPRW, 2023.
- Augmented Autoencoders: Implicit 3D orientation learning for 6D object detection. IJCV, 2019.
- Latent-class hough forests for 3D object detection and pose estimation. ECCV, 2014.
- 6-DoF pose estimation of household objects for robotic manipulation: An accessible dataset and benchmark. IROS, 2022.
- Ref-NeRF: Structured view-dependent appearance for neural radiance fields. In CVPR, 2022.
- A method for 6D pose estimation of free-form rigid objects using point pair features on range data. Sensors, 2018.
- GDR-Net: Geometry-guided direct regression network for monocular 6D object pose estimation. CVPR, 2021.
- Full 3D reconstruction of transparent objects. ACM TOG, 2018.
- Keypoint cascade voting for point cloud based 6DoF pose estimation. arXiv preprint arXiv:2210.08123, 2022.
- PoseCNN: A convolutional neural network for 6D object pose estimation in cluttered scenes. RSS, 2018.
- TEASER: Fast and Certifiable Point Cloud Registration. IEEE Trans. Robotics, 2020.
- DPOD: 6D pose object detector and refiner. ICCV, 2019.
- Tomas Hodan (22 papers)
- Martin Sundermeyer (18 papers)
- Van Nguyen Nguyen (9 papers)
- Gu Wang (25 papers)
- Eric Brachmann (27 papers)
- Bertram Drost (8 papers)
- Vincent Lepetit (101 papers)
- Carsten Rother (74 papers)
- Jiri Matas (133 papers)
- Yann Labbe (4 papers)