Region-Transformer: Self-Attention Region Based Class-Agnostic Point Cloud Segmentation (2403.01407v1)
Abstract: Point cloud segmentation, which helps us understand the environment of specific structures and objects, can be performed in class-specific and class-agnostic ways. We propose a novel region-based transformer model called Region-Transformer for performing class-agnostic point cloud segmentation. The model utilizes a region-growth approach and self-attention mechanism to iteratively expand or contract a region by adding or removing points. It is trained on simulated point clouds with instance labels only, avoiding semantic labels. Attention-based networks have succeeded in many previous methods of performing point cloud segmentation. However, a region-growth approach with attention-based networks has yet to be used to explore its performance gain. To our knowledge, we are the first to use a self-attention mechanism in a region-growth approach. With the introduction of self-attention to region-growth that can utilize local contextual information of neighborhood points, our experiments demonstrate that the Region-Transformer model outperforms previous class-agnostic and class-specific methods on indoor datasets regarding clustering metrics. The model generalizes well to large-scale scenes. Key advantages include capturing long-range dependencies through self-attention, avoiding the need for semantic labels during training, and applicability to a variable number of objects. The Region-Transformer model represents a promising approach for flexible point cloud segmentation with applications in robotics, digital twinning, and autonomous vehicles.
- Projection-based point convolution for efficient point cloud segmentation. IEEE Access, 10:15348–15358.
- Joint 2d-3d-semantic data for indoor scene understanding. ArXiv e-prints.
- Curvature of point clouds through principal component analysis.
- Lrgnet: Learnable region growing for class-agnostic point cloud segmentation. IEEE Robotics and Automation Letters, 6(2):2799–2806.
- 3d point cloud processing and learning for autonomous driving: Impacting map creation, localization, and perception. IEEE Signal Processing Magazine, 38(1):68–86.
- Scannet: Richly-annotated 3d reconstructions of indoor scenes. In Proc. Computer Vision and Pattern Recognition (CVPR). IEEE.
- Segmentation of building point cloud models including detailed architectural/structural features and mep systems. Automation in Construction, 51(C):32–45. Publisher Copyright: © 2014 Elsevier B.V. All rights reserved.
- Know what your neighbors do: 3d semantic segmentation of point clouds, page 395–409. Springer International Publishing.
- Feature extraction from point clouds. In International Meshing Roundtable Conference.
- A comprehensive performance evaluation of 3d local feature descriptors. International Journal of Computer Vision, 116.
- Deep learning for 3d point clouds: A survey.
- Gyawali, D. (2023). Lrtransformer: Learn-region transformer for object-agnostic point cloud segmentation. Master’s thesis, Louisiana State University.
- Learning to optimally segment point clouds.
- Research on improved region growing point cloud algorithm. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, XLII-3/W10:153–157.
- Research and application of semantic point cloud on indoor robots. In 2021 5th International Conference on Communication and Information Systems (ICCIS), pages 108–113.
- Automatic generation of structural geometric digital twins from point clouds. Sci Rep, 12:22321.
- 3d point cloud segmentation: A survey. In 2013 6th IEEE Conference on Robotics, Automation and Mechatronics (RAM), page 225–230.
- 3d point cloud clustering with learnable robust geometric constraints.
- Jsis3d: Joint semantic-instance segmentation of 3d point clouds with multi-task pointwise networks and multi-value conditional random fields.
- Low-cost augmented reality systems via 3d point cloud sensors. In 2011 Seventh International Conference on Signal Image Technology & Internet-Based Systems, pages 188–192.
- Pointnet: Deep learning on point sets for 3d classification and segmentation.
- Pointnet++: Deep hierarchical feature learning on point sets in a metric space.
- Segmentation of point clouds using smoothness constraints. In Maas, H. and Schneider, D., editors, ISPRS 2006 : Proceedings of the ISPRS commission V symposium, volume 35, pages 248–253. International Society for Photogrammetry and Remote Sensing (ISPRS). ISPRS commission V symposium : image.
- Fast point feature histograms (fpfh) for 3d registration. In 2009 IEEE International Conference on Robotics and Automation, pages 3212–3217.
- Class-agnostic segmentation loss and its application to salient object detection and segmentation.
- Attention is all you need.
- Curvature and density based feature point detection for point cloud data. In IET 3rd International Conference on Wireless, Mobile and Multimedia Networks (ICWMNN 2010), page 377–380.
- Unsupervised point cloud representation learning with deep neural networks: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, pages 1–20.
- Unseen object instance segmentation for robotic environments.
- Learning object bounding boxes for 3d instance segmentation on point clouds.
- Exploring self-attention for image recognition.
- Point transformer.
- Few-shot 3d point cloud semantic segmentation.
- Robust normal estimation for 3d lidar point clouds in urban environments. Sensors, 19(5).
- Fast and accurate normal estimation for point cloud via patch stitching.
- Dipesh Gyawali (6 papers)
- Jian Zhang (543 papers)
- BB Karki (1 paper)