2019 (modified: 11 Nov 2022)ICML 2019Readers: Everyone
Abstract:We consider the problem of learning from sparse and underspecified rewards, where an agent receives a complex input, such as a natural language instruction, and needs to generate a complex response...