iARCS: Iterative Agentic RL for Controllable 3D Scene Generation

Saugat Adhikari; Ashok Prasad Neupane; Pramish Paudel; Ajad Chhatkuli; Danda Pani Paudel

iARCS: Iterative Agentic RL for Controllable 3D Scene Generation

Saugat Adhikari, Ashok Prasad Neupane, Pramish Paudel, Ajad Chhatkuli, Danda Pani Paudel

17 Mar 2026 (modified: 22 Jun 2026)CVPR 2026 Workshop SynData4CVEveryoneRevisionsCC BY 4.0

Keywords: 3D Indoor Scene Generation, Synthetic 3D Data, RL, Diffusion

Abstract: Synthetic 3D scene generation is increasingly used as a data source for computer vision and embodied AI, but existing generators often optimize perceptual realism without reliably satisfying task-critical functional constraints. This mismatch limits the usefulness of synthetic data for downstream training, where accessibility, traversability, and spatial rule compliance are often essential. We present iARCS, an iterative agentic reinforcement learning framework that adapts a pretrained scene generator to natural-language task requirements. iARCS uses a two-stage strategy: universal-reward pretraining to improve physical plausibility and layout quality, followed by task-specific fine-tuning with LLM-generated reward programs that are iteratively refined from training feedback. Experiments show improved constraint fidelity on walkability, reachability, and clearance-focused tasks, strong task-conditioned generalization, and competitive scene diversity. We further show that data generated by iARCS improves a base generator, supporting its value as a practical synthetic data generation tool rather than only a controllable scene editing method.

Supplementary Material: pdf

Submission Number: 55

Loading