Published: 01 Jan 2021, Last Modified: 12 May 2023ICML 2021Readers: Everyone
Abstract:We study reinforcement learning (RL) in episodic tabular MDPs with adversarial corruptions, where some episodes can be adversarially corrupted. When the total number of corrupted episodes is known,...