Spherical-Nested Diffusion Model for Panoramic Image Outpainting

Published: 01 May 2025, Last Modified: 18 Jun 2025ICML 2025 posterEveryoneRevisionsBibTeXCC BY 4.0
Abstract: Panoramic image outpainting acts as a pivotal role in immersive content generation, allowing for seamless restoration and completion of panoramic content. Given the fact that the majority of generative outpainting solutions operates on planar images, existing methods for panoramic images address the sphere nature by soft regularisation during the end-to-end learning, which still fails to fully exploit the spherical content. In this paper, we set out the first attempt to impose the sphere nature in the design of diffusion model, such that the panoramic format is intrinsically ensured during the learning procedure, named as spherical-nested diffusion (SpND) model. This is achieved by employing spherical noise in the diffusion process to address the structural prior, together with a newly proposed spherical deformable convolution (SDC) module to intrinsically learn the panoramic knowledge. Upon this, the proposed method is effectively integrated into a pre-trained diffusion model, outperforming existing state-of-the-art methods for panoramic image outpainting. In particular, our SpND method reduces the FID values by more than 50\% against the state-of-the-art PanoDiffusion method. Codes are publicly available at \url{https://github.com/chronos123/SpND}.
Lay Summary: Acquiring panoramic images ($360^\circ$ images) plays an important role in various applications in virtual reality (VR) and augmented reality (AR). However, collecting panoramic images is infeasible and time consuming. We developed techniques that transform standard photos and existing scenes into seamless panoramic images. This approach could enhance VR and AR applications by creating immersive environments from everyday visuals.
Link To Code: https://github.com/chronos123/SpND
Primary Area: Deep Learning->Generative Models and Autoencoders
Keywords: Panoramic image outpainting, spherical convolution, ERP format
Submission Number: 1233
Loading