Language-Guided Traffic Simulation via Scene-Level DiffusionDownload PDF

Published: 30 Aug 2023, Last Modified: 09 Oct 2023CoRL 2023 OralReaders: Everyone
Keywords: Traffic Simulation, Multi-Agent Diffusion, Large Language Model
TL;DR: A scene-level conditional diffusion model with a LLM based language interface for realistic and controllable traffic simulation.
Abstract: Realistic and controllable traffic simulation is a core capability that is necessary to accelerate autonomous vehicle (AV) development. However, current approaches for controlling learning-based traffic models require significant domain expertise and are difficult for practitioners to use. To remedy this, we present CTG++, a scene-level conditional diffusion model that can be guided by language instructions. Developing this requires tackling two challenges: the need for a realistic and controllable traffic model backbone, and an effective method to interface with a traffic model using language. To address these challenges, we first propose a scene-level diffusion model equipped with a spatio-temporal transformer backbone, which generates realistic and controllable traffic. We then harness a large language model (LLM) to convert a user's query into a loss function, guiding the diffusion model towards query-compliant generation. Through comprehensive evaluation, we demonstrate the effectiveness of our proposed method in generating realistic, query-compliant traffic simulations.
Student First Author: yes
Supplementary Material: zip
Instructions: I have read the instructions for authors (https://corl2023.org/instructions-for-authors/)
Publication Agreement: pdf
10 Replies

Loading