Mixed-View Panorama Synthesis using Geospatially Guided Diffusion

TMLR Paper4142 Authors

04 Feb 2025 (modified: 07 Apr 2025)Under review for TMLREveryoneRevisionsBibTeXCC BY 4.0
Abstract: We introduce the task of mixed-view panorama synthesis, where the goal is to synthesize a novel panorama given a small set of input panoramas and a satellite image of the area. This contrasts with previous work which only uses input panoramas (same-view synthesis), or an input satellite image (cross-view synthesis). We argue that the mixed-view setting is the most natural to support panorama synthesis for arbitrary locations worldwide. A critical challenge is that the spatial coverage of panoramas is uneven, with few panoramas available in many regions of the world. We introduce an approach that utilizes diffusion-based modeling and an attention-based architecture for extracting information from all available input imagery. Experimental results demonstrate the effectiveness of our proposed method. In particular, our model can handle scenarios when the available panoramas are sparse or far from the location of the panorama we are attempting to synthesize. Our code and model checkpoints will be made publicly available upon publication.
Submission Length: Regular submission (no more than 12 pages of main content)
Assigned Action Editor: ~Marcus_A_Brubaker1
Submission Number: 4142
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview