Comparing human and machine communication patterns through a Tangram game

Published: 23 Sept 2025, Last Modified: 06 Dec 2025DBM 2025 Findings PosterEveryoneRevisionsBibTeXCC BY 4.0
TL;DR: Modern AI systems still fall short on human-like collaborative communication, failing to develop the shared linguistic conventions that humans naturally create through iterative dialogue.
Abstract: When humans communicate about visual objects, they develop shared linguistic conventions that progressively reduce referential ambiguity through collaborative dialogue. To better understand the representational patterns underlying human communication and test whether vision-capable large language models (VLLMs) exhibit similar communicative behaviors, we compare human-human and agent-agent interactions in the tangram communication game. In this task, two players establish shared references for abstract shapes through dialogue across six repeated rounds. We analyzed existing human-human data and conducted agent-agent experiments with five VLLMs, measuring performance and using representational probes to explore the potential structure underlying performance. Humans demonstrate clear convention formation, with representations becoming increasingly distinguishable across rounds as task accuracy improves from 78\% to 96\%. In contrast, AI agents fail to exhibit similar collaborative patterns in our experiments, achieving consistently low performance (10-30\%) with minimal improvement and no evidence of convention development, despite access to interim accuracy reports, full conversation history, and (most curiously) what appear to be largely accurate initial descriptions by 'director' agents. Taken together, these preliminary results suggest that current VLLMs---without task-specific training---may still struggle with kinds of grounded, evolving, coreferential communication of that defines human language in collaborative contexts.
Length: short paper (up to 4 pages)
Domain: data
Author List Check: The author list is correctly ordered and I understand that additions and removals will not be allowed after the abstract submission deadline.
Anonymization Check: This submission has been anonymized for double-blind review via the removal of identifying information such as names, affiliations, and URLs that point to identifying information.
Submission Number: 57
Loading