Published: 01 Jan 2021, Last Modified: 12 May 2023ALT 2021Readers: Everyone
Abstract:In this paper, we propose new problem-independent lower bounds on the sample complexity and regret in episodic MDPs, with a particular focus on the \emph{non-stationary case} in which the transitio...