Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Yu Gu; Kai Zhang; Yuting Ning; Boyuan Zheng; Boyu Gou; Tianci Xue; Cheng Chang; Sanjari Srivastava; Yanan Xie; Peng Qi; Huan Sun; Yu Su

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Yu Gu, Kai Zhang, Yuting Ning, Boyuan Zheng, Boyu Gou, Tianci Xue, Cheng Chang, Sanjari Srivastava, Yanan Xie, Peng Qi, Huan Sun, Yu Su

Published: 06 Nov 2025, Last Modified: 06 Nov 2025Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: Language agents based on large language models (LLMs) have demonstrated great promise in automating web-based tasks. Recent work has shown that incorporating advanced planning algorithms, e.g., tree search, is advantageous over reactive planning for web agents. However, unlike simulated sandbox environments, real-world environments such as the web are rife with irreversible actions. This undermines the feasibility of backtracking, a cornerstone of (tree) search. Overly relying on test-time search also hurts efficiency. We advocate model-based planning for web agents that employs a world model to simulate and deliberate over the outcome of each candidate action before committing to one. We systematically explore this paradigm by: (1) Proposing a model-based planning framework, WebDreamer, which employs LLMs to serve as both world models and value functions; (2) Training specialized LLMs as world models with a scalable data synthesis pipeline. Empirical results demonstrate that WebDreamers achieves substantial performance improvements over reactive baselines. It is competitive, while being - times more efficient, with tree search in sandbox environments (VisualWebArena) and also works effectively on real-world websites (Online-Mind2Web and Mind2Web-Live). Furthermore, our trained world model, Dreamer-7B, performs comparable to GPT-4o, highlighting the potential of specialized world models for efficient and effective planning in complex web environments. All code, models, and data are publicly available at https://github.com/OSU-NLP-Group/WebDreamer

Submission Length: Regular submission (no more than 12 pages of main content)

Code: https://github.com/OSU-NLP-Group/WebDreamer

Assigned Action Editor: ~Erin_J_Talvitie1

Submission Number: 5336

Loading