Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Sensor Fusion by Spatial Encoding for Autonomous Driving (2308.10707v1)

Published 17 Aug 2023 in cs.CV, cs.AI, cs.LG, and cs.RO

Abstract: Sensor fusion is critical to perception systems for task domains such as autonomous driving and robotics. Recently, the Transformer integrated with CNN has demonstrated high performance in sensor fusion for various perception tasks. In this work, we introduce a method for fusing data from camera and LiDAR. By employing Transformer modules at multiple resolutions, proposed method effectively combines local and global contextual relationships. The performance of the proposed method is validated by extensive experiments with two adversarial benchmarks with lengthy routes and high-density traffics. The proposed method outperforms previous approaches with the most challenging benchmarks, achieving significantly higher driving and infraction scores. Compared with TransFuser, it achieves 8% and 19% improvement in driving scores for the Longest6 and Town05 Long benchmarks, respectively.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Quoc-Vinh Lai-Dang (4 papers)
  2. Jihui Lee (5 papers)
  3. Bumgeun Park (6 papers)
  4. Dongsoo Har (34 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.