A Distance-based Anomaly Detection Framework for Deep Reinforcement Learning

Hongming Zhang; Ke Sun; bo xu; Linglong Kong; Martin Müller

A Distance-based Anomaly Detection Framework for Deep Reinforcement Learning

Hongming Zhang, Ke Sun, bo xu, Linglong Kong, Martin Müller

Published: 22 Oct 2024, Last Modified: 22 Oct 2024Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: In deep reinforcement learning (RL) systems, abnormal states pose significant risks by potentially triggering unpredictable behaviors and unsafe actions, thus impeding the deployment of RL systems in real-world scenarios. It is crucial for reliable decision-making systems to have the capability to cast an alert whenever they encounter unfamiliar observations that they are not equipped to handle. In this paper, we propose a novel Mahalanobis distance-based (MD) anomaly detection framework, called \textit{MDX}, for deep RL algorithms. MDX simultaneously addresses random, adversarial, and out-of-distribution (OOD) state outliers in both offline and online settings. It utilizes Mahalanobis distance within class-conditional distributions for each action and operates within a statistical hypothesis testing framework under the Gaussian assumption. We further extend it to robust and distribution-free versions by incorporating Robust MD and conformal inference techniques. Through extensive experiments on classical control environments, Atari games and autonomous driving scenarios, we demonstrate the effectiveness of our MD-based detection framework. MDX offers a simple, unified, and practical tool for enhancing the safety and reliability of RL systems in real-world applications.

Submission Length: Long submission (more than 12 pages of main content)

Previous TMLR Submission Url: https://openreview.net/forum?id=4UblldgC9p&referrer=%5BAuthor%20Console%5D(%2Fgroup%3Fid%3DTMLR%2FAuthors%23your-submissions)

Changes Since Last Submission: We revisited the manuscript and ensured that the font now conforms to the template default.

Assigned Action Editor: ~Zhiyu_Zhang1

Submission Number: 3068

Loading