Keywords: Multi AI-agent system, Constrained Multi-Objective-Optimisation, Pareto Optimality, AI agent Sovereignty
TL;DR: A proposal to leverage constrained multi-objective optimisation to formalise the evaluation of collaboration and crucially agent sovereignty in multi agent systems.
Abstract: Large Language Models (LLMs) are increasingly deployed as efficiency tools within organisations. The next step is their deployment as autonomous agents acting on behalf of these organisations and operating in complex socio-technical systems. In such settings, it will be necessary for agents to be sovereign, i.e. able to make decisions based on private incentives, shared objectives and operational constraints, requiring multi-objective evaluation. However, existing benchmarks and evaluation frameworks for multi-LLM agents rely predominantly on scalar metrics that do not capture this complex objective landscape. Here, we argue that Constrained Multi-Objective Optimisation provides a deployment-relevant evaluation framework for multi-LLM agent systems. We formalise two realistic types of trade-offs and present the results of experiments illustrating how Pareto dominance and hypervolume indicator reveal behavioural properties hidden by scalar metrics on different scenarios run in the CAMEL environment.
Submission Number: 91
Loading