Keywords: multi-agent systems, model deference, inter-agent dynamics, consensus formation, multi-agent deliberation, large language models
TL;DR: We study how consensus emerges in multi-agent LLM systems and show that agreement is often driven by directional model deference rather than fully independent reasoning.
Abstract: Multi-agent systems are increasingly being deployed in real-world applications, and understanding inter-agent dynamics is critical for developing reliable and robust systems. While multi-agent systems have been shown to improve accuracy, the underlying interaction dynamics that drive consensus more generally remain poorly understood. In this paper, we conduct an empirical study of multi-turn agent interactions, analyzing how consensus forms through disagreement and model deference across both objective and subjective datasets. Across experiments, we find that model deference is not a fixed hierarchical property in heterogeneous settings, but instead emerges only under specific conditions. In contrast, homogeneous settings do not exhibit a consistent hierarchical structure. Under answer rotation, even though smaller models do tend to defer to larger models the majority of the time, the rate at which larger models defer to smaller models increases. This shows that model identity may not be the sole explanatory factor for model deference. Additionally, multi-agent dynamics can be actively controlled via system prompts. Overall, disagreement and model deference provide informative signals for studying multi-agent behavior beyond accuracy to determine the reliability and robustness of multi-agent systems.
Email Sharing: We authorize the sharing of all author emails with Program Chairs.
Data Release: We authorize the release of our submission and author names to the public in the event of acceptance.
Submission Number: 63
Loading