Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Region-Transformer: Self-Attention Region Based Class-Agnostic Point Cloud Segmentation (2403.01407v1)

Published 3 Mar 2024 in cs.CV, cs.AI, and cs.RO

Abstract: Point cloud segmentation, which helps us understand the environment of specific structures and objects, can be performed in class-specific and class-agnostic ways. We propose a novel region-based transformer model called Region-Transformer for performing class-agnostic point cloud segmentation. The model utilizes a region-growth approach and self-attention mechanism to iteratively expand or contract a region by adding or removing points. It is trained on simulated point clouds with instance labels only, avoiding semantic labels. Attention-based networks have succeeded in many previous methods of performing point cloud segmentation. However, a region-growth approach with attention-based networks has yet to be used to explore its performance gain. To our knowledge, we are the first to use a self-attention mechanism in a region-growth approach. With the introduction of self-attention to region-growth that can utilize local contextual information of neighborhood points, our experiments demonstrate that the Region-Transformer model outperforms previous class-agnostic and class-specific methods on indoor datasets regarding clustering metrics. The model generalizes well to large-scale scenes. Key advantages include capturing long-range dependencies through self-attention, avoiding the need for semantic labels during training, and applicability to a variable number of objects. The Region-Transformer model represents a promising approach for flexible point cloud segmentation with applications in robotics, digital twinning, and autonomous vehicles.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (35)
  1. Projection-based point convolution for efficient point cloud segmentation. IEEE Access, 10:15348–15358.
  2. Joint 2d-3d-semantic data for indoor scene understanding. ArXiv e-prints.
  3. Curvature of point clouds through principal component analysis.
  4. Lrgnet: Learnable region growing for class-agnostic point cloud segmentation. IEEE Robotics and Automation Letters, 6(2):2799–2806.
  5. 3d point cloud processing and learning for autonomous driving: Impacting map creation, localization, and perception. IEEE Signal Processing Magazine, 38(1):68–86.
  6. Scannet: Richly-annotated 3d reconstructions of indoor scenes. In Proc. Computer Vision and Pattern Recognition (CVPR). IEEE.
  7. Segmentation of building point cloud models including detailed architectural/structural features and mep systems. Automation in Construction, 51(C):32–45. Publisher Copyright: © 2014 Elsevier B.V. All rights reserved.
  8. Know what your neighbors do: 3d semantic segmentation of point clouds, page 395–409. Springer International Publishing.
  9. Feature extraction from point clouds. In International Meshing Roundtable Conference.
  10. A comprehensive performance evaluation of 3d local feature descriptors. International Journal of Computer Vision, 116.
  11. Deep learning for 3d point clouds: A survey.
  12. Gyawali, D. (2023). Lrtransformer: Learn-region transformer for object-agnostic point cloud segmentation. Master’s thesis, Louisiana State University.
  13. Learning to optimally segment point clouds.
  14. Research on improved region growing point cloud algorithm. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, XLII-3/W10:153–157.
  15. Research and application of semantic point cloud on indoor robots. In 2021 5th International Conference on Communication and Information Systems (ICCIS), pages 108–113.
  16. Automatic generation of structural geometric digital twins from point clouds. Sci Rep, 12:22321.
  17. 3d point cloud segmentation: A survey. In 2013 6th IEEE Conference on Robotics, Automation and Mechatronics (RAM), page 225–230.
  18. 3d point cloud clustering with learnable robust geometric constraints.
  19. Jsis3d: Joint semantic-instance segmentation of 3d point clouds with multi-task pointwise networks and multi-value conditional random fields.
  20. Low-cost augmented reality systems via 3d point cloud sensors. In 2011 Seventh International Conference on Signal Image Technology & Internet-Based Systems, pages 188–192.
  21. Pointnet: Deep learning on point sets for 3d classification and segmentation.
  22. Pointnet++: Deep hierarchical feature learning on point sets in a metric space.
  23. Segmentation of point clouds using smoothness constraints. In Maas, H. and Schneider, D., editors, ISPRS 2006 : Proceedings of the ISPRS commission V symposium, volume 35, pages 248–253. International Society for Photogrammetry and Remote Sensing (ISPRS). ISPRS commission V symposium : image.
  24. Fast point feature histograms (fpfh) for 3d registration. In 2009 IEEE International Conference on Robotics and Automation, pages 3212–3217.
  25. Class-agnostic segmentation loss and its application to salient object detection and segmentation.
  26. Attention is all you need.
  27. Curvature and density based feature point detection for point cloud data. In IET 3rd International Conference on Wireless, Mobile and Multimedia Networks (ICWMNN 2010), page 377–380.
  28. Unsupervised point cloud representation learning with deep neural networks: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, pages 1–20.
  29. Unseen object instance segmentation for robotic environments.
  30. Learning object bounding boxes for 3d instance segmentation on point clouds.
  31. Exploring self-attention for image recognition.
  32. Point transformer.
  33. Few-shot 3d point cloud semantic segmentation.
  34. Robust normal estimation for 3d lidar point clouds in urban environments. Sensors, 19(5).
  35. Fast and accurate normal estimation for point cloud via patch stitching.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Dipesh Gyawali (6 papers)
  2. Jian Zhang (543 papers)
  3. BB Karki (1 paper)

Summary

We haven't generated a summary for this paper yet.