Vehicular Multimodal Motion Forecasting via Conditional Score-based Modeling

Published: 01 Jan 2023, Last Modified: 16 May 2025VTC Fall 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Accurately forecasting the future motions of road participants is essential for proactive hazard avoidance and safety planning of autonomous vehicles. Existing methods for motion prediction based on probabilistic generative models are limited to low-accuracy likelihood calculations and relatively finite mode distributions. Recent studies show that score-based models can naturally overcome these limitations. In this work, we present a novel paradigm of conditional score-based models for vehicle motion prediction, called Motion-CSM. First, we model scene contextual representations of interaction regions at the feature level via graph convolutional networks. We then interpolate these representations as conditions into the solution process of the continuous-time reverse stochastic differential equation (SDE) to guide trajectory generation, which progressively converts the known prior distributions into multimodal trajectories including the ground truth modes. The designed stacked Transformer structure with dual control conditions is adopted to learn the score function approximation of the Gaussian perturbation kernel. Finally, we develop multiple consistency constraints to align the inference results of Motion-CSM in reverse SDE solving to improve the self-consistency and stability of multimodal trajectory generation. Experimental results on the real-world motion dataset demonstrate that the multimodal forecasting accuracy of Motion-CSM outperforms state-of-the-art methods.
Loading