Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)

Bojun Huang

2022 (modified: 07 Aug 2022)ICML 2022Readers: Everyone

Abstract: This paper discusses a new approach to the fundamental problem of learning optimal Q-functions. In this approach, optimal Q-functions are formulated as saddle points of a nonlinear Lagrangian funct...

0 Replies