A novel multi-step Q-learning method to improve data efficiency for deep reinforcement learning

Published: 01 Jan 2019, Last Modified: 13 May 2025Knowl. Based Syst. 2019EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•A novel multi-step Q-learning method is proposed to improve data efficiency for DRL.•The proposed multi-step Q-learning method is derived by adopting a new return function.•The new return function alters the discount of future rewards and loosens the impact of the immediate reward.•Experimental-results shows the proposed methods can improve the data efficiency of DRL agents.
Loading