Spatial Understanding from Videos: Structured Prompts Meet Simulation Data.

Haoyu Zhang, Meng Liu 0006, Zaijing Li, Haokun Wen, Weili Guan, Yaowei Wang 0001, Liqiang Nie

14 Jan 2026 (modified: 16 Jan 2026)CoRR 2025EveryoneRevisionsCC BY-SA 4.0
Loading