SymLight: Exploring Interpretable and Deployable Symbolic Policies for Traffic Signal Control

Xiao-Cheng Liao; Yi Mei; Mengjie Zhang

SymLight: Exploring Interpretable and Deployable Symbolic Policies for Traffic Signal Control

Xiao-Cheng Liao, Yi Mei, Mengjie Zhang

20 Sept 2025 (modified: 11 Feb 2026)Submitted to ICLR 2026EveryoneRevisionsBibTeXCC BY 4.0

Keywords: Reinforcement learning, traffic signal control, Monte Carlo tree search

TL;DR: Discovering interpretable and deployable symbolic policies for traffic signal control

Abstract: Deep Reinforcement Learning have achieved significant success in automatically devising effective traffic signal control (TSC) policies. Neural policies, however, tend to be over-parameterized and non-transparent, hindering their interpretability and deployability on resource-limited edge devices. This work presents SymLight, a priority function search framework based on Monte Carlo Tree Search (MCTS) for discovering inherently interpretable and deployable symbolic priority functions to serve as the TSC policies. The priority function, in particular, accepts traffic features as input and then outputs a priority for each traffic signal phase, which subsequently directs the phase transition. For effective search, we propose a concise yet expressive priority function representation. This helps mitigate the combinatorial explosion of the action space in MCTS. Additionally, a probabilistic structural rollout strategy is introduced to leverage structural patterns from previously discovered high-quality priority functions, guiding the rollout process. Our experiments on real-world datasets demonstrate SymLight's superior performance across a range of baselines. A key advantage is SymLight's ability to produce interpretable and deployable TSC policies while maintaining excellent performance. Our codes will be released upon acceptance.

Supplementary Material: zip

Primary Area: applications to robotics, autonomy, planning

Submission Number: 23530

Loading