Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multiple View Performers for Shape Completion (2209.06291v1)

Published 13 Sep 2022 in cs.CV and cs.RO

Abstract: We propose the Multiple View Performer (MVP) - a new architecture for 3D shape completion from a series of temporally sequential views. MVP accomplishes this task by using linear-attention Transformers called Performers. Our model allows the current observation of the scene to attend to the previous ones for more accurate infilling. The history of past observations is compressed via the compact associative memory approximating modern continuous Hopfield memory, but crucially of size independent from the history length. We compare our model with several baselines for shape completion over time, demonstrating the generalization gains that MVP provides. To the best of our knowledge, MVP is the first multiple view voxel reconstruction method that does not require registration of multiple depth views and the first causal Transformer based model for 3D shape completion.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. David Watkins (12 papers)
  2. Peter Allen (48 papers)
  3. Krzysztof Choromanski (96 papers)
  4. Jacob Varley (14 papers)
  5. Nicholas Waytowich (26 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.