Physics-guided Shape-from-Template: Monocular Video Perception through Neural Surrogate Models (2311.12796v3)
Abstract: 3D reconstruction of dynamic scenes is a long-standing problem in computer graphics and increasingly difficult the less information is available. Shape-from-Template (SfT) methods aim to reconstruct a template-based geometry from RGB images or video sequences, often leveraging just a single monocular camera without depth information, such as regular smartphone recordings. Unfortunately, existing reconstruction methods are either unphysical and noisy or slow in optimization. To solve this problem, we propose a novel SfT reconstruction algorithm for cloth using a pre-trained neural surrogate model that is fast to evaluate, stable, and produces smooth reconstructions due to a regularizing physics simulation. Differentiable rendering of the simulated mesh enables pixel-wise comparisons between the reconstruction and a target video sequence that can be used for a gradient-based optimization procedure to extract not only shape information but also physical parameters such as stretching, shearing, or bending stiffness of the cloth. This allows to retain a precise, stable, and smooth reconstructed geometry while reducing the runtime by a factor of 400-500 compared to $\phi$-SfT, a state-of-the-art physics-based SfT approach.
- Large steps in cloth simulation. In Annual Conference on Computer Graphics and Interactive Techniques, page 43–54, 1998.
- Shape-from-template. IEEE TPAMI, 37(10):2099–2118, 2015.
- Pbns: Physically based neural simulator for unsupervised garment pose space deformation. CoRR, abs/2012.11310, 2020.
- Neural cloth simulation. ACM TOG, 41, 2022.
- Recovering non-rigid 3d shape from image streams. In CVPR, pages 690–696 vol.2, 2000.
- The isowarp: the template-based visual geometry of isometric surfaces. IJCV, 129(7):2194–2222, 2021.
- Yarn-level simulation of woven cloth. ACM TOG, 33(6), 2014.
- Diffpd: Differentiable projective dynamics. ACM TOG, 41(2), 2021.
- Learning-based bending stiffness parameter estimation by a drape tester. ACM TOG, 41(6), 2022.
- Multiscale meshgraphnets. In ICML Workshop, 2022.
- Texture-generic deep shape-from-template. IEEE Access, 9:75211–75230, 2021.
- Hdm-net: Monocular non-rigid 3d reconstruction with learned deformation model. In Virtual Reality and Augmented Reality, pages 51–72. Springer International Publishing, 2018.
- Fine-grained differentiable physics: a yarn-level model for fabrics. In ICLR, 2022.
- Hood: Hierarchical graphs for generalized modelling of clothing dynamics. In CVPR, pages 16965–16974, 2023.
- Neuralsim: Augmenting differentiable simulators with neural networks. In IEEE ICRA, pages 9474–9481, 2021.
- Neural implicit representations for physical parameter inference from a single video. In WACV, pages 2093–2103, 2023.
- Difftaichi: Differentiable programming for physical simulation. In ICLR, 2020.
- Pavel Iakubovskii. Segmentation models. https://github.com/qubvel/segmentation_models, 2019.
- Estimating cloth simulation parameters from a static drape using neural networks. IEEE Access, 8:195113–195121, 2020.
- ϕitalic-ϕ\phiitalic_ϕ-SfT: Shape-from-Template with a Physics-Based Deformation Model. In CVPR, pages 3948–3958, 2022.
- Neuralclothsim: Neural deformation fields meet the kirchhoff-love thin shell theory. arXiv:2308.12970, 2023.
- Adam: A method for stochastic optimization. In ICLR, 2014.
- Modular primitives for high-performance differentiable rendering. ACM TOG, 39(6), 2020.
- Efficient cloth simulation using miniature cloth and upscaling deep neural networks. CoRR, 2019.
- Diffcloth: Differentiable cloth simulation with dry frictional contact. ACM TOG, 42(1), 2022.
- Differentiable cloth simulation for inverse problems. In NeurIPS. Curran Associates, Inc., 2019.
- Miles Macklin. Warp: A high-performance python framework for gpu simulation and graphics. https://github.com/nvidia/warp, 2022. NVIDIA GPU Technology Conference (GTC).
- Physical simulation layer for accurate 3d modeling. In CVPR, pages 13514–13523, 2022.
- Adaptive anisotropic remeshing for cloth simulation. ACM TOG, 31(6), 2012.
- Dense image registration and deformable surface reconstruction in presence of occlusions and minimal texture. In ICCV, 2015.
- Hierarchical cloth simulation using deep neural networks. In CGI, page 139–146. ACM, 2018.
- Local non-rigid structure-from-motion from diffeomorphic mappings. In CVPR, 2020.
- Learning mesh-based simulation with graph networks. In ICLR, 2020.
- Geometry-aware network for non-rigid shape prediction from a single view. In CVPR, 2018.
- Scalable differentiable physics for learning and control. In ICML, pages 7847–7856, 2020.
- How will it drape like? capturing fabric mechanics from depth images. Computer Graphics Forum, 42(2):149–160, 2023.
- U-net: Convolutional networks for biomedical image segmentation. In MICCAI, pages 234–241. Springer International Publishing, 2015.
- Deformable surface tracking ambiguities. In CVPR, pages 1–8, 2007.
- Self-supervised collision handling via generative 3d garment models for virtual try-on. In CVPR, pages 11763–11773, 2021.
- Snug: Self-supervised neural dynamic garments. In CVPR, pages 8140–8150, 2022.
- Ismo-gan: Adversarial learning for monocular non-rigid 3d reconstruction. In CVPRW, 2019.
- Neural dense non-rigid structure from motion with latent space constraints. In ECCV, 2020.
- John M. Sullivan. Curves of Finite Total Curvature, pages 137–161. Birkhäuser Basel, Basel, 2008.
- Diffcloud: Real-to-sim from point clouds with differentiable simulation and rendering of deformable objects. In IROS, pages 10828–10835, 2022.
- State of the art in dense monocular non-rigid 3d reconstruction. Comput. Graph. Forum, 42(2):485–520, 2023.
- 3d human pose estimation via intuitive physics. In CVPR, pages 4713–4725, 2023.
- Unsupervised deep learning of incompressible fluid dynamics. CoRR, abs/2006.08762, 2020.
- Interdiff: Generating 3d human-object interactions with physics-informed diffusion. In ICCV, pages 14928–14940, 2023.
- Ppr: Physically plausible reconstruction from monocular videos. In ICCV, pages 3914–3924, 2023.
- Learning-based cloth material recovery from video. In ICCV, 2017.
- Direct, dense, and deformable: Template-based non-rigid 3d reconstruction from rgb video. In ICCV, 2015.
- Physdiff: Physics-guided human motion diffusion model. In ICCV, pages 16010–16021, 2023.