RAFT-MSF: Self-Supervised Monocular Scene Flow using Recurrent Optimizer (2205.01568v1)

Published 3 May 2022 in cs.CV, cs.LG, and cs.RO

Abstract: Learning scene flow from a monocular camera still remains a challenging task due to its ill-posedness as well as lack of annotated data. Self-supervised methods demonstrate learning scene flow estimation from unlabeled data, yet their accuracy lags behind (semi-)supervised methods. In this paper, we introduce a self-supervised monocular scene flow method that substantially improves the accuracy over the previous approaches. Based on RAFT, a state-of-the-art optical flow model, we design a new decoder to iteratively update 3D motion fields and disparity maps simultaneously. Furthermore, we propose an enhanced upsampling layer and a disparity initialization technique, which overall further improves accuracy up to 7.2%. Our method achieves state-of-the-art accuracy among all self-supervised monocular scene flow methods, improving accuracy by 34.2%. Our fine-tuned model outperforms the best previous semi-supervised method with 228 times faster runtime. Code will be publicly available.

Authors (3)

Bayram Bayramli (5 papers)
Junhwa Hur (20 papers)
Hongtao Lu (76 papers)

Citations (7)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

RAFT-MSF: Self-Supervised Monocular Scene Flow using Recurrent Optimizer (2205.01568v1)

Summary

Related Papers