Autonomous driving policy learning from demonstration using regression loss function

Yukun Xiao, Yisheng An, Ting Li, Naiqi Wu, Wei He, Peng Li

Published: 01 Jan 2024, Last Modified: 11 Apr 2025Knowl. Based Syst. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•A novel pre-training DRL algorithm simplifies the pre-training phase.•The algorithm reduces the format requirements on the demonstration data.•The algorithm simplifies the dominance term.•A novel priority formula fulfills algorithm’s needs for replaying experience.•Double target-networks achieve more reliable training.