Bridging the reality gap: A benchmark for physical reasoning in general world models with various physical phenomena beyond mechanics

Published: 01 Jan 2025, Last Modified: 18 Jul 2025Expert Syst. Appl. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•A physical reasoning benchmark covering four physical phenomena.•The first physical reasoning benchmark constructed from real-world videos.•Experiments show general world models have limited physical reasoning capabilities.•Analyze error cases and propose improvement strategies.
Loading