Red Teaming the Rules: An Adversarial Approach to Legal Alignment

Rui-Jie Yew; Greg Demirchyan

Red Teaming the Rules: An Adversarial Approach to Legal Alignment

Rui-Jie Yew, Greg Demirchyan

Published: 02 Mar 2026, Last Modified: 17 Apr 2026AFAA 2026 PosterEveryoneRevisionsBibTeXCC BY 4.0

Track: Tiny/Short Papers Track (up to 3 pages)

Keywords: legal alignment, agents, red teaming

Abstract: A core steering mechanism in a piece of regulation may lie not in what is written within it --- the primal path it lays to follow its rules--- but in what is left out of it --- the dual path it lays to avoid them. In this early-stage work, we frame a process of legal alignment (the alignment of a piece of regulation's text with its goals) as a multi-agent zero-sum game. Our hope is to demonstrate the relevance of AI alignment methods to legal oversight and the value of an adversarial frame for policy design.

Submission Number: 46

Loading