Are Multimodal Large Language Models Pragmatically Competent Listeners in Simple Reference Resolution Tasks?

Published: 2025, Last Modified: 26 May 2026ACL (Findings) 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading