RELLIS-3D Dataset: Data, Benchmarks and Analysis (2011.12954v4)

Published 17 Nov 2020 in cs.CV and cs.RO

Abstract: Semantic scene understanding is crucial for robust and safe autonomous navigation, particularly so in off-road environments. Recent deep learning advances for 3D semantic segmentation rely heavily on large sets of training data, however existing autonomy datasets either represent urban environments or lack multimodal off-road data. We fill this gap with RELLIS-3D, a multimodal dataset collected in an off-road environment, which contains annotations for 13,556 LiDAR scans and 6,235 images. The data was collected on the Rellis Campus of Texas A&M University and presents challenges to existing algorithms related to class imbalance and environmental topography. Additionally, we evaluate the current state-of-the-art deep learning semantic segmentation models on this dataset. Experimental results show that RELLIS-3D presents challenges for algorithms designed for segmentation in urban environments. This novel dataset provides the resources needed by researchers to continue to develop more advanced algorithms and investigate new research directions to enhance autonomous navigation in off-road environments. RELLIS-3D is available at https://github.com/unmannedlab/RELLIS-3D

Authors (4)

Peng Jiang (274 papers)
Philip Osteen (5 papers)
Maggie Wigness (10 papers)
Srikanth Saripalli (40 papers)

Citations (183)

View on Semantic Scholar

Summary

The paper presents a novel multimodal dataset with 13,556 LiDAR scans and 6,235 images designed for off-road semantic segmentation challenges.
The evaluation shows that state-of-the-art models achieve mIoU scores of 52.92% on images and 43.07% on LiDAR point clouds amid significant class imbalance.
The dataset’s integration of diverse sensor data paves the way for advancing autonomous navigation in complex, unstructured off-road environments.

Analysis of the RELLIS-3D Dataset for Off-Road Autonomous Navigation

The paper "RELLIS-3D Dataset: Data, Benchmarks and Analysis" presents a novel dataset designed to enhance the semantic understanding required for autonomous navigation in off-road environments. The authors introduce RELLIS-3D, a comprehensive multimodal dataset, addressing the notable lack of such resources that support semantic segmentation in challenging non-urban settings.

Dataset Composition and Characteristics

RELLIS-3D is distinguished by its diverse collection of 13,556 LiDAR scans and 6,235 images, annotated for semantic segmentation tasks. This dataset fills a crucial gap, providing multimodal data representative of off-road conditions, countering the urban-centric focus of existing datasets. Critical components of the dataset include RGB camera images, LiDAR point clouds, stereo images, high-precision GPS measurements, and inertial data. The synchronization of diverse sensor modalities via Precision Time Protocol ensures coherent multimodal data, facilitating advanced algorithm development.

The dataset's annotations include pixel-wise labels for images and point-wise labels for LiDAR data. The ontology used for labeling is expansive, covering 20 distinct classes that encapsulate the complexity of off-road terrains and objects. Notably, these classes highlight an inherent class imbalance, a characteristic feature of off-road datasets, which poses unique challenges compared to more structured urban environments.

Benchmark Evaluation

The authors conduct comprehensive experiments to assess the performance of state-of-the-art semantic segmentation models, both for image and point-cloud processing. The image-based methods evaluated include HRNETV2+OCR and Gated-SCNN, while for point clouds, SalsaNext and KPConv are tested. Results indicate a significant drop in performance compared to urban scene datasets, with mIoU scores of 52.92% for HRNETV2+OCR on images and 43.07% for SalsaNext on point clouds.

The degradation in model performance is primarily attributed to severe class imbalance and the unstructured nature of off-road environments. The presence of visually challenging scenarios, such as thick vegetation and varying terrain types, further complicates semantic segmentation efforts. Images often exhibit rich textures, while LiDAR provides indispensable depth information critical for detecting and navigating obstacles, underscoring the importance of multimodal integration.

Implications and Future Directions

The implications of the RELLIS-3D dataset extend into both practical applications and theoretical advancements. Practically, the dataset provides a robust platform for developing and refining algorithms aimed at improving autonomous vehicles' effectiveness in off-road conditions. Theoretically, it challenges the robustness of existing models, prompting a reevaluation of strategies to handle class imbalance and integrate multimodal data.

Future research directions include expanding the dataset to incorporate diverse off-road environments and enhancing semantic labels with attributes like traversability and terrain features. Such expansions could improve dataset generality and provide richer context for navigation. Moreover, leveraging the dataset's high-precision GPS and stereo capabilities could foster new opportunities in odometry and SLAM applications, with potential for integrating semantic understanding to enhance navigation performance.

Overall, the RELLIS-3D dataset is positioned as a substantial contribution to the field of autonomous navigation, providing critical resources to push the boundaries of current research in complex, unstructured environments.

PDF Markdown

Related Papers

GitHub

GitHub - unmannedlab/RELLIS-3D: RELLIS-3D: A Multi-modal Dataset for Off-Road Robotics (257 stars)