Learning Road Scene-level Representations via Semantic Region Prediction

Zihao Xiao; Alan Yuille; Yi-Ting Chen

Learning Road Scene-level Representations via Semantic Region Prediction

Zihao Xiao, Alan Yuille, Yi-Ting Chen

Published: 10 Sept 2022, Last Modified: 05 May 2023CoRL 2022 PosterReaders: Everyone

Keywords: Semantic Region Prediction, Egocentric Vision, Driver Intent, Risk Object Identification

TL;DR: We propose a novel task called Semantic Region Prediction to learn road scene-level representations for two vital tasks in automated driving systems.

Abstract: In this work, we tackle two vital tasks in automated driving systems, i.e., driver intent prediction and risk object identification from egocentric images. Mainly, we investigate the question: what would be good road scene-level representations for these two tasks? We contend that a scene-level representation must capture higher-level semantic and geometric representations of traffic scenes around ego-vehicle while performing actions to their destinations. To this end, we introduce the representation of semantic regions, which are areas where ego-vehicles visit while taking an afforded action (e.g., left-turn at 4-way intersections). We propose to learn scene-level representations via a novel semantic region prediction task and an automatic semantic region labeling algorithm. Extensive evaluations are conducted on the HDD and nuScenes datasets, and the learned representations lead to state-of-the-art performance for driver intention prediction and risk object identification.

Student First Author: yes

Supplementary Material: zip

16 Replies

Loading