NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos

Published: 25 Mar 2026, Last Modified: 25 Mar 2026CVPREveryonearXiv.org perpetual, non-exclusive license
Abstract: In this paper, we propose NeoVerse, a versatile 4D world model that is capable of 4D reconstruction, novel-trajectory video generation, and rich downstream applications. We first identify a common limitation of scalability in current 4D world modeling methods, caused either by expensive and specialized multi-view 4D data or by cumbersome training pre-processing. In contrast, our NeoVerse is built upon a core philosophy that makes the full pipeline scalable to diverse in-the-wild monocular videos.
Loading