Robust Policy Gradient against Strong Data CorruptionDownload PDFOpen Website

Published: 2021, Last Modified: 22 Sept 2023ICML 2021Readers: Everyone
Abstract: We study the problem of robust reinforcement learning under adversarial corruption on both rewards and transitions. Our attack model assumes an \textit{adaptive} adversary who can arbitrarily corru...
0 Replies

Loading