2022 (modified: 07 Nov 2022)ICML 2022Readers: Everyone
Abstract:Off-policy reinforcement learning (RL) from pixel observations is notoriously unstable. As a result, many successful algorithms must combine different domain-specific practices and auxiliary losses...