Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regimeDownload PDF

28 Sept 2020, 15:50 (edited 18 Mar 2021, 10:53)ICLR 2021 PosterReaders: Everyone
Keywords:
Abstract:
One-sentence Summary:
Code Of Ethics:
10 Replies

Loading