2019 (modified: 11 Nov 2022)ICML 2019Readers: Everyone
Abstract:Despite the remarkable success of Deep RL in learning control policies from raw pixels, the resulting models do not generalize. We demonstrate that a trained agent fails completely when facing smal...