Scaling Environments for LLM Agents in the Era of Learning from Interaction: A Survey

Yuchen Huang; Sijia Li; Minghao LIU; Wei Liu; Zhiyuan Fan; Yi R. Fung

Scaling Environments for LLM Agents in the Era of Learning from Interaction: A Survey

Yuchen Huang, Sijia Li, Minghao LIU, Wei Liu, Zhiyuan Fan, Yi R. Fung

Published: 28 Sept 2025, Last Modified: 23 Oct 2025SEA @ NeurIPS 2025 PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: LLM Agents, Scaling Environments, Task Generation, Feedback Provision, Generator–Verifier Asymmetry

TL;DR: We systematically review representative methods for environment scaling from a pioneering environment-centric perspective and organize them along the stages of the GEF loop, namely task generation, task execution, and feedback.

Abstract: LLM-based agents can autonomously accomplish complex tasks across various domains. However, to further cultivate capabilities such as adaptive behavior and long-term decision-making, training on static datasets built from human-level knowledge is insufficient. These datasets are costly to construct and lack both dynamism and realism. A growing consensus is that agents should instead interact directly with environments and learn from experience through reinforcement learning. We formalize this iterative process as the Generation-Execution-Feedback (GEF) loop, where environments generate tasks to challenge agents, return observations in response to agents' actions during task execution, and provide evaluative feedback on rollouts for subsequent learning. Under this paradigm, environments function as indispensable producers of experiential data, highlighting the need to scale them toward greater complexity, realism, and interactivity. In this survey, we first systematically review representative methods for environment scaling from a pioneering environment-centric perspective and organize them along the stages of the GEF loop. We further analyze benchmarks, implementation frameworks, and applications, consolidating fragmented advances and outlining future research directions for agent intelligence.

Archival Option: The authors of this submission do *not* want it to appear in the archival proceedings.

Submission Number: 120

Loading