Mini Amusement Parks (MAPs): A Testbed for Modelling Business Decisions

Published: 16 Nov 2025, Last Modified: 16 Nov 2025OpenReview Archive Direct UploadEveryoneCC BY 4.0
Abstract: Despite rapid progress in artificial intelligence, current systems struggle with the interconnected challenges that define real-world decision making. Practical domains, such as business management, require optimizing an open-ended and multi-faceted objective, actively learning environment dynamics from sparse experience, planning over long horizons in stochastic settings, and reasoning over spatial information. Yet existing human--AI benchmarks isolate subsets of these capabilities, limiting our ability to assess holistic decision-making competence. We introduce \envname\ (\envacro), an amusement-park simulator designed to evaluate an agent’s ability to model its environment, anticipate long-term consequences under uncertainty, and strategically operate a complex business. We provide human baselines and a comprehensive evaluation of state-of-the-art LLM agents, finding that humans outperform these systems by 6.5× on easy mode and 9.8× on medium mode. Our analysis reveals persistent weaknesses in long-horizon optimization, sample-efficient learning, spatial reasoning, and world modelling. By unifying these challenges within a realistic, scalable environment, \envacro\ offers a new foundation for developing and benchmarking agents capable of adaptable decision making.
Loading