RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark

Federico Berto; Chuanbo Hua; Junyoung Park; Laurin Luttmann; Yining Ma; Fanchen Bu; Jiarui Wang; Haoran Ye; Minsu Kim; Sanghyeok Choi; Nayeli Gast Zepeda; André Hottung; Jianan Zhou; Jieyi Bi; Yu Hu; Fei Liu; Hyeonah Kim; Jiwoo Son; Haeyeon Kim; Davide Angioni; Wouter Kool; Zhiguang Cao; Qingfu Zhang; Joungho Kim; Jie Zhang; Kijung Shin; Cathy Wu; Sungsoo Ahn; Guojie Song; Changhyun Kwon; Kevin Tierney; Lin Xie; Jinkyoo Park

RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark

28 Sept 2024 (modified: 09 Nov 2024)ICLR 2025 Conference Desk Rejected SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Reinforcement Learning, Combinatorial Optimization, PyTorch, Benchmark

TL;DR: We present RL4CO, a novel, user-friendly and extensive Reinforcement Learning for Combinatorial Optimization benchmark

Abstract: Deep reinforcement learning (RL) has recently shown significant benefits in solving combinatorial optimization (CO) problems, reducing reliance on domain expertise, and improving computational efficiency. However, the field lacks a unified benchmark for easy development and standardized comparison of algorithms across diverse CO problems. To fill this gap, we introduce RL4CO, a unified and extensive benchmark with in-depth library coverage of 23 state-of-the-art methods and 20+ CO problems. Built on efficient software libraries and best practices in implementation, RL4CO features modularized implementation and flexible configuration of diverse RL algorithms, neural network architectures, inference techniques, and environments. RL4CO allows researchers to seamlessly navigate existing successes and develop their unique designs, facilitating the entire research process by decoupling science from heavy engineering. We also provide extensive benchmark studies to inspire new insights and future work. RL4CO has attracted numerous researchers in the community and is open-sourced.

Primary Area: datasets and benchmarks

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics.

Submission Guidelines: I certify that this submission complies with the submission instructions as described on https://iclr.cc/Conferences/2025/AuthorGuide.

Reciprocal Reviewing: I understand the reciprocal reviewing requirement as described on https://iclr.cc/Conferences/2025/CallForPapers. If none of the authors are registered as a reviewer, it may result in a desk rejection at the discretion of the program chairs. To request an exception, please complete this form at https://forms.gle/Huojr6VjkFxiQsUp6.

Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors’ identity.

No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.

Submission Number: 13736

Loading