Unsupervised Discovery and Composition of Object Light Fields

Cameron Omid Smith; Hong-Xing Yu; Sergey Zakharov; Fredo Durand; Joshua B. Tenenbaum; Jiajun Wu; Vincent Sitzmann

Unsupervised Discovery and Composition of Object Light Fields

Cameron Omid Smith, Hong-Xing Yu, Sergey Zakharov, Fredo Durand, Joshua B. Tenenbaum, Jiajun Wu, Vincent Sitzmann

Published: 20 Jun 2023, Last Modified: 17 Sept 2024Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: Neural scene representations, both continuous and discrete, have recently emerged as a powerful new paradigm for 3D scene understanding. Recent efforts have tackled unsupervised discovery of object-centric neural scene representations. However, the high cost of ray-marching, exacerbated by the fact that each object representation has to be ray-marched separately, leads to insufficiently sampled radiance fields and thus, noisy renderings, poor framerates, and high memory and time complexity during training and rendering. Here, we propose to represent objects in an object-centric, compositional scene representation as light fields. We propose a novel light field compositor module that enables reconstructing the global light field from a set of object-centric light fields. Dubbed Compositional Object Light Fields (COLF), our method enables unsupervised learning of object-centric neural scene representations, state-of-the-art reconstruction and novel view synthesis performance on standard datasets, and rendering and training speeds at orders of magnitude faster than existing 3D approaches.

Submission Length: Regular submission (no more than 12 pages of main content)

Previous TMLR Submission Url: https://openreview.net/forum?id=fDgdqEuKX0

Code: https://github.com/cameronosmith/COLF

Supplementary Material: zip

Assigned Action Editor: ~Antoni_B._Chan1

License: Creative Commons Attribution 4.0 International (CC BY 4.0)

Submission Number: 772

Loading