Abstract: Neural radiance fields (NeRF) have recently appeared as a powerful tool for generating realistic views of objects and confined areas. Still, they face serious challenges with open scenes, where the camera has unrestricted movement, and content can appear at any distance. In such scenarios, current NeRF-inspired models frequently yield hazy or pixelated outputs, suffer slow training times, and might display irregularities because of the challenging task of reconstructing an extensive scene from a limited number of images. We propose a new framework to boost the performance of NeRF-based architectures yielding significantly superior outcomes compared to the prior work. Our solution overcomes several obstacles that affected earlier versions of NeRF, including handling multiple video inputs, selecting keyframes, and extracting poses from real-world frames that are ambiguous and symmetrical. Furthermore, we applied our framework, called “Pre-NeRF 360”, to enable the use of the Nutrition5k dataset in NeRF and introduce an updated version of this dataset, known as the N5k360 dataset. The source code, the dataset, and pre-trained weights for Pre-NeRF are publicly available at (https://amughrabi.github.io/prenerf).
External IDs:doi:10.1007/978-3-031-99565-1_7
Loading