Monocular Dynamic Gaussian Splatting: Fast, Brittle, and Scene Complexity Rules

Yiqing Liang; Mikhail Okunev; Mikaela Angelina Uy; Runfeng Li; Leonidas Guibas; James Tompkin; Adam W Harley

Monocular Dynamic Gaussian Splatting: Fast, Brittle, and Scene Complexity Rules

Yiqing Liang, Mikhail Okunev, Mikaela Angelina Uy, Runfeng Li, Leonidas Guibas, James Tompkin, Adam W Harley

Published: 20 Jun 2025, Last Modified: 20 Jun 2025Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: Gaussian splatting methods are emerging as a popular approach for converting multi-view image data into scene representations that allow view synthesis. In particular, there is interest in enabling view synthesis for dynamic scenes using only monocular input data---an ill-posed and challenging problem. The fast pace of work in this area has produced multiple simultaneous papers that claim to work best, which cannot all be true. In this work, we organize, benchmark, and analyze many Gaussian-splatting-based methods, providing apples-to-apples comparisons that prior works have lacked. We use multiple existing datasets and a new instructive synthetic dataset designed to isolate factors that affect reconstruction quality. We systematically categorize Gaussian splatting methods into specific motion representation types and quantify how their differences impact performance. Empirically, we find that their rank order is well-defined in synthetic data, but the complexity of real-world data currently overwhelms the differences. Furthermore, the fast rendering speed of all Gaussian-based methods comes at the cost of brittleness in optimization. We summarize our experiments into a list of findings that can help to further progress in this lively problem setting.

Certifications: Survey Certification

Submission Length: Long submission (more than 12 pages of main content)

Code: https://github.com/lynl7130/MonoDyGauBench_code

Assigned Action Editor: ~Jia-Bin_Huang1

Submission Number: 3932

Loading