DeepPhy: Benchmarking Agentic VLMs on Physical Reasoning

Xinrun Xu, Pi Bu, Ye Wang, Börje F. Karlsson, Ziming Wang, Tengtao Song, Qi Zhu, Jun Song, Zhiming Ding, Bo Zheng

Published: 2026, Last Modified: 01 Apr 2026AAAI 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading