Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Dimensions of Motion: Monocular Prediction through Flow Subspaces (2112.01502v4)

Published 2 Dec 2021 in cs.CV

Abstract: We introduce a way to learn to estimate a scene representation from a single image by predicting a low-dimensional subspace of optical flow for each training example, which encompasses the variety of possible camera and object movement. Supervision is provided by a novel loss which measures the distance between this predicted flow subspace and an observed optical flow. This provides a new approach to learning scene representation tasks, such as monocular depth prediction or instance segmentation, in an unsupervised fashion using in-the-wild input videos without requiring camera poses, intrinsics, or an explicit multi-view stereo step. We evaluate our method in multiple settings, including an indoor depth prediction task where it achieves comparable performance to recent methods trained with more supervision.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Richard Strong Bowen (8 papers)
  2. Richard Tucker (24 papers)
  3. Ramin Zabih (19 papers)
  4. Noah Snavely (86 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.