Overcoming intermittent instability in reinforcement learning via gradient norm preservation

Published: 01 Jan 2025, Last Modified: 14 May 2025Inf. Sci. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•The method stabilizes reinforcement learning by reducing intermittent gradient spikes.•Intermittent gradient spikes are controlled using adaptive learning rate adjustments.•The method preserves initial gradient norms, aiding stable value learning.•Experiments demonstrate improved stability and performance in reinforcement learning.
Loading