AMLGENTEX: Mobilizing Data-Driven Research to Combat Money Laundering

ICLR 2026 Conference Submission17138 Authors

19 Sept 2025 (modified: 08 Oct 2025)ICLR 2026 Conference SubmissionEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Graph Neural Networks, Anti-Money Laundry, Privacy, Graph data, Distribution shift, Imbalance, Dataset, Synthetic Data, Heterophilic Graphs
TL;DR: AMLgentex is an open-source framework for generating realistic transaction data that enables rigorous evaluation of anti-money laundering methods under real-world conditions.
Abstract: Money laundering enables organized crime by moving illicit funds into the legitimate economy. Although trillions of dollars are laundered each year, detection rates remain low because launderers evade oversight, confirmed cases are rare, and institutions see only fragments of the global transaction network. Since access to real transaction data is tightly restricted, synthetic datasets are essential for developing and evaluating detection methods. However, existing datasets fall short: they often neglect partial observability, temporal dynamics, strategic behavior, uncertain labels, class imbalance, and network-level dependencies. We introduce AMLGentex, an open-source suite for generating realistic, configurable transaction data and benchmarking detection methods. AMLGentex enables systematic evaluation of anti-money laundering systems under conditions that mirror real-world challenges. By releasing multiple country-specific datasets and practical parameter guidance, we aim to empower researchers and practitioners and provide a common foundation for collaboration and progress in combating money laundering.
Primary Area: datasets and benchmarks
Submission Number: 17138
Loading