Distributionally Robust Policy Evaluation and Learning in Offline Contextual BanditsDownload PDFOpen Website

2020 (modified: 05 Nov 2022)ICML 2020Readers: Everyone
Abstract: Policy learning using historical observational data is an important problem that has found widespread applications. However, existing literature rests on the crucial assumption that the future envi...
0 Replies

Loading