Abstract: The neural radiance field (NeRF) has made significant strides in representing 3D scenes and synthesizing novel views. Despite its advancements, the high computational costs of NeRF have posed challenges for its deployment in resource-constrained environments and real-time applications. As an alternative to NeRF-like neural rendering methods, 3D Gaussian Splatting (3DGS) offers rapid rendering speeds while maintaining excellent image quality. However, as it represents objects and scenes using a myriad of Gaussians, it requires substantial storage to achieve high-quality representation. To mitigate the storage overhead, we propose Factorized 3D Gaussian Splatting (F-3DGS), a novel approach that drastically reduces storage requirements while preserving image quality. Inspired by classical matrix and tensor factorization techniques, our method represents and approximates dense clusters of Gaussians with significantly fewer Gaussians through efficient factorization. We aim to efficiently represent dense 3D Gaussians by approximating them with a limited amount of information for each axis and their combinations. This method allows us to encode a substantially large number of Gaussians along with their essential attributes---such as color, scale, and rotation---necessary for rendering using a relatively small number of elements. Extensive experimental results demonstrate that F-3DGS achieves a significant reduction in storage costs while maintaining comparable quality in rendered images.
Primary Subject Area: [Content] Media Interpretation
Secondary Subject Area: [Content] Vision and Language
Relevance To Conference: This research delves into exploring various representations in 3D reconstruction models. By employing lighter models and enhancing rendering speed, identifying more suitable representations for objects or natural scenes can significantly enhance multimedia representation. This study showcases the remarkable efficiency of point-based 3D representation, marking a significant stride towards 3D Gaussian Splatting (3DGS) as a promising scene representation technique. The adoption of lighter and swifter models holds the promise of accelerated rendering speeds. Simultaneously, the utilization of lighter models opens avenues for achieving high-fidelity 3D reconstructions on compact devices such as smartphones or tablets, transcending the limitations of conventional computers. Furthermore, in the future, this advancement will pave the way for more compact and realistic Augmented Reality (AR) and Virtual Reality (VR) experiences.
Supplementary Material: zip
Submission Number: 2641
Loading