Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting

Published 13 Sep 2021 in cs.CV | (2109.06061v2)

Abstract: In this work, we address the problem of jointly estimating albedo, normals, depth and 3D spatially-varying lighting from a single image. Most existing methods formulate the task as image-to-image translation, ignoring the 3D properties of the scene. However, indoor scenes contain complex 3D light transport where a 2D representation is insufficient. In this paper, we propose a unified, learning-based inverse rendering framework that formulates 3D spatially-varying lighting. Inspired by classic volume rendering techniques, we propose a novel Volumetric Spherical Gaussian representation for lighting, which parameterizes the exitant radiance of the 3D scene surfaces on a voxel grid. We design a physics based differentiable renderer that utilizes our 3D lighting representation, and formulates the energy-conserving image formation process that enables joint training of all intrinsic properties with the re-rendering constraint. Our model ensures physically correct predictions and avoids the need for ground-truth HDR lighting which is not easily accessible. Experiments show that our method outperforms prior works both quantitatively and qualitatively, and is capable of producing photorealistic results for AR applications such as virtual object insertion even for highly specular objects.

Abstract PDF Upgrade to Chat

Citations (70)

View on Semantic Scholar

Summary

The paper introduces a novel volumetric spherical Gaussian model that captures 3D spatially-varying indoor lighting for accurate inverse rendering.
It employs a differentiable ray tracing renderer using energy conservation to predict lighting without requiring HDR ground truths.
Experimental results show improved scale-invariant error metrics and angular accuracy, enabling photorealistic augmented reality applications.

Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting

The paper explores the intricate problem of inverse rendering within indoor environments, addressing the estimation of intrinsic scene properties such as albedo, normals, depth, and lighting, from a single image. The novelty lies in the development of a framework that shifts the paradigm from traditional image-to-image translation approaches to a more comprehensive 3D spatially-varying representation, incorporating both high dynamic range (HDR) and spatial light variance.

The authors introduce a sophisticated Volumetric Spherical Gaussian (VSG) model to encapsulate the lighting dynamics of 3D scenes. This approach surpasses conventional spherical harmonics and 2D spatially-varying representations, enabling detailed capture of view-dependent lighting variations. With a differentiable ray tracing renderer, the model adopts energy-conservation principles to derive physically-accurate lighting predictions without reliance on inaccessible HDR ground truths. The framework synergizes direct and joint prediction modules to iteratively refine intrinsic properties, ensuring spatial coherence and realistic renditions vital for applications in augmented reality (AR).

Experiments affirm the model's superior performance over established methods, yielding photorealistic renderings for AR tasks. Key numerical results include enhanced scale-invariant mean squared error and more accurate angular error metrics for surface normals, showcasing the framework's proficient delineation of scene characteristics and lighting effects. Notably, the approach facilitates AR applications like virtual object insertion, adeptly supporting complex indoor light transport.

The implications are profound, suggesting new pathways in inverse rendering research, particularly pushing towards enriched interpretations of scene illumination. Future progress may see extensions into multi-view systems and real-time applications, granted the foundational aspects established here.

Markdown

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Paper Prompts

Top Community Prompts

Explain it Like I'm 14

off on

Knowledge Gaps

off on

Practical Applications

off on

Glossary

off on

Conceptual Simplification

off on

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Generate Now

Continue Learning

Authors (4)

Collections

YouTube

Show All Videos

Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting

Summary

Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting

Paper to Video (Beta)

Whiteboard

Paper Prompts

Top Community Prompts

Open Problems

Continue Learning

Authors (4)

Collections

YouTube

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research

Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting

Summary

Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting

Paper to Video (Beta)

Whiteboard

Paper Prompts

Top Community Prompts

Open Problems

Continue Learning

Related Papers

Authors (4)

Collections

YouTube

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research