Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

139 tokens/sec

GPT-4o

47 tokens/sec

Gemini 2.5 Pro Pro

43 tokens/sec

o3 Pro

4 tokens/sec

GPT-4.1 Pro

47 tokens/sec

DeepSeek R1 via Azure Pro

28 tokens/sec

2000 character limit reached

3 2 1

Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis (2312.16812v2)

Published 28 Dec 2023 in cs.CV and cs.GR

Abstract: Novel view synthesis of dynamic scenes has been an intriguing yet challenging problem. Despite recent advancements, simultaneously achieving high-resolution photorealistic results, real-time rendering, and compact storage remains a formidable task. To address these challenges, we propose Spacetime Gaussian Feature Splatting as a novel dynamic scene representation, composed of three pivotal components. First, we formulate expressive Spacetime Gaussians by enhancing 3D Gaussians with temporal opacity and parametric motion/rotation. This enables Spacetime Gaussians to capture static, dynamic, as well as transient content within a scene. Second, we introduce splatted feature rendering, which replaces spherical harmonics with neural features. These features facilitate the modeling of view- and time-dependent appearance while maintaining small size. Third, we leverage the guidance of training error and coarse depth to sample new Gaussians in areas that are challenging to converge with existing pipelines. Experiments on several established real-world datasets demonstrate that our method achieves state-of-the-art rendering quality and speed, while retaining compact storage. At 8K resolution, our lite-version model can render at 60 FPS on an Nvidia RTX 4090 GPU. Our code is available at https://github.com/oppo-us-research/SpacetimeGaussians.

References (97)

Citations (75)

View on Semantic Scholar

Summary

The paper introduces a novel dynamic scene representation using Spacetime Gaussians that capture static and transient features with temporal opacity and parametric motion.
It employs splatted neural features instead of traditional spherical harmonics, ensuring compact storage while maintaining expressive view- and time-dependent details.
Guided sampling based on training errors and coarse depth information enhances rendering quality, achieving 8K resolution at 60 fps on high-end GPUs.

Introduction to Spacetime Gaussian Feature Splatting

Rendering photorealistic views of dynamic scenes in real-time has been a significant challenge in the field of computer vision and graphics. Achieving a combination of high-resolution, real-time rendering, and compact storage remains particularly demanding. Current technologies enabling users to explore dynamic scenes with novel viewpoints are of great interest due to their applications in virtual and augmented reality, broadcasting, and education.

Innovations in Dynamic View Synthesis

A recent development addresses the intricate balance between rendering quality, speed, and storage efficiency. A new dynamic scene representation, termed Spacetime Gaussian Feature Splatting, has been proposed, incorporating three innovative components:

Spacetime Gaussians: A novel approach which extends the concept of 3D Gaussians by incorporating temporal opacity and parametric motion/rotation into the traditional model. This allows for capturing static and dynamic features as well as transient content, which can consist of objects emerging or vanishing over time.
Splatted Feature Rendering: This new technique forgoes spherical harmonics and instead utilizes neural features, which are smaller in size but offer robust expressiveness. These features handle view- and time-dependent appearances, contributing to the model's compactness.
Guided Sampling: The optimization process is improved by sampling new Gaussians in areas that are difficult to render well, particularly those that are sparsely covered or located at a distance. This is guided by training error and coarse depth information, enhancing the rendering quality in complex scenes.

State-of-the-Art Performance

Experiments performed using this new representation have shown that it achieves remarkable results in terms of rendering quality and speed, even while maintaining a small model size. At a high 8K resolution, the model could render videos at 60 frames per second when tested on powerful hardware (Nvidia RTX 4090 GPU).

Contributions and Applications

This research presents several notable contributions:

A Spacetime Gaussian model which efficiently renders dynamic views with high fidelity.
A new rendering technique based on neural features rather than traditional spherical harmonics, enhancing the model's compactness.
An innovative sampling method that refines rendering quality by focusing on challenging areas.
Extensive testing on various real-world datasets demonstrates that the method surpasses current art in rendering quality and speed while ensuring a compact model size.

Conclusion and Future Work

The introduction of Spacetime Gaussian Feature Splatting marks a significant advance in dynamic view synthesis. By addressing the key challenges of rendering quality, speed, and model compactness, this technology is poised to enhance user experiences across multiple applications. However, the representation is not without limitations; it currently requires multi-view video inputs and cannot be trained on-the-fly. Future explorations may include adapting the model for monocular settings and improving its training efficiency to support streaming applications.

PDF Markdown

GitHub

Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis

Tweets

https://twitter.com/22146921/status/1740878320441631083

https://twitter.com/1637708085958696961/status/1741088541927780498

https://twitter.com/1565330182176911367/status/1740583199091396627

https://twitter.com/WilliamLamkin/status/1748125909859631498

HackerNews

Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis (2 points, 0 comments)