Arena: A Scalable and Configurable Benchmark for Policy LearningDownload PDF

08 Jun 2021 (modified: 24 May 2023)Submitted to NeurIPS 2021 Datasets and Benchmarks Track (Round 1)Readers: Everyone
Keywords: GNN-based policy, object-based state representation, scalable policy, configurable benchmark
TL;DR: We introduce the Arena benchmark, a scalable and configurable benchmark for policy learning.
Abstract: We believe current benchmarks for policy learning lack two important properties: scalability and configurability. The growing literature on modeling policies as graph neural networks calls for an object-based benchmark where the number of objects can be arbitrarily scaled and the mechanics can be freely configured. We introduce the Arena benchmark, a scalable and configurable benchmark for policy learning. Arena provides an object-based game-like environment where the number of objects can be arbitrarily scaled and the mechanics can be configured with a large degree of freedom. In this way, arena is designed to be an all-in-one environment that uses scaling and configuration to smoothly interpolates multiple dimensions of decision making that require different degrees of inductive bias.
Supplementary Material: zip
URL: https://github.com/Sirui-Xu/Arena
10 Replies

Loading