Published: 01 Jan 2022, Last Modified: 12 May 2023ICML 2022Readers: Everyone
Abstract:We propose an algorithm that uses linear function approximation (LFA) for stochastic shortest path (SSP). Under minimal assumptions, it obtains sublinear regret, is computationally efficient, and u...