Published: 01 Jan 2025, Last Modified: 12 Apr 2025Expert Syst. Appl. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract:Highlights•Proposal of RAV for robust action reward evaluation in RL frameworks.•Addressing sparse rewards: Velocity-domain advantage estimation.•Outperforms distance-based RL in diverse confrontational PDC scenarios.