Visual-Inertial Navigation: A Concise Review (1906.02650v1)

Published 6 Jun 2019 in cs.RO

Abstract: As inertial and visual sensors are becoming ubiquitous, visual-inertial navigation systems (VINS) have prevailed in a wide range of applications from mobile augmented reality to aerial navigation to autonomous driving, in part because of the complementary sensing capabilities and the decreasing costs and size of the sensors. In this paper, we survey thoroughly the research efforts taken in this field and strive to provide a concise but complete review of the related work -- which is unfortunately missing in the literature while being greatly demanded by researchers and engineers -- in the hope to accelerate the VINS research and beyond in our society as a whole.

Authors (1)

Guoquan Huang (32 papers)

Citations (264)

View on Semantic Scholar

Summary

The paper provides a comprehensive review of visual-inertial navigation by evaluating filtering and optimization algorithms for robust state estimation.
It demonstrates that tightly-coupled sensor fusion and precise calibration of camera-IMU systems are essential for accurate navigation in GPS-denied environments.
The review outlines future research directions, including deep learning and advanced sensor technologies, to overcome persistent localization challenges.

A Review of Visual-Inertial Navigation Systems

The paper "Visual-Inertial Navigation: A Concise Review" by Guoquan (Paul) Huang presents a comprehensive analysis of visual-inertial navigation systems (VINS), which have gained considerable traction in various applications such as mobile augmented reality, autonomous driving, and aerial navigation. The integration of inertial and visual sensors, facilitated by advancements in sensor technology and cost reduction, has enabled the development of robust navigation systems capable of operating in GPS-denied environments. This paper provides a critical review of the current state of VINS research, highlighting key methodologies, challenges, and future directions.

The paper is structured systematically, beginning with an introduction to the foundational concepts of inertial navigation systems (INS). These systems estimate the six degrees of freedom (6DOF) poses using data from inertial measurement units (IMUs). One primary challenge with INS is the degradation of pose estimates over time due to sensor noise and bias. VINS address this challenge by integrating visual data from cameras, which, despite their own limitations, provide complementary information to stabilize and enhance INS estimates.

Key components of VINS are explored in detail, particularly the state estimation algorithms used to fuse IMU measurements with visual data. Two main approaches are discussed: filtering-based methods, such as the extended Kalman filter (EKF), and optimization-based methods, which offer the advantage of relinearizing measurements to minimize errors at the cost of higher computational demands. The development of the multi-state constraint Kalman filter (MSCKF) and other variants like the square-root inverse sliding window filter (SR-ISWF) demonstrate the iterative evolution of algorithms aimed at improving efficiency and consistency.

The paper distinguishes between tightly-coupled and loosely-coupled sensor fusion methods, with tightly-coupled approaches achieving higher accuracy by processing visual and inertial data in a unified manner. It further explores the variations between visual-inertial odometry (VIO) and VI-SLAM frameworks. While VIO focuses on short-term accuracy without global map considerations, VI-SLAM incorporates feature mapping and loop-closure techniques to minimize drift over long trajectories.

Significant emphasis is placed on sensor calibration, both spatially and temporally, to ensure precise coordination between camera and IMU data streams. This calibration is crucial for accurate VINS operation, given that even small misalignments can lead to substantial estimation errors. The calibration can be performed offline using specialized tools such as the Kalibr toolbox, or online, which provides flexibility in dynamic environments.

The review concludes with a discussion on challenges and future research directions in VINS. Persistent localization, especially in large and dynamic environments, remains a critical hurdle. Emerging technologies such as deep learning and advanced sensor modalities, including event cameras and LiDARs, offer promising avenues for future VINS improvements. Semantic understanding and cooperative navigation are identified as transformative capabilities that could significantly enhance VINS applications.

Overall, the paper by Huang provides an insightful synthesis of the current landscape of visual-inertial navigation systems, serving as a crucial resource for researchers and engineers seeking to develop and refine VINS technologies. Through meticulous exploration of methodologies and latent challenges, this review underscores both the accomplishments and the potential growth areas within this dynamic field.

PDF Markdown

Visual-Inertial Navigation: A Concise Review (1906.02650v1)

Summary

A Review of Visual-Inertial Navigation Systems

Related Papers