Offline Reinforcement Learning for Optimizing Production Bidding Policies

Dmytro Korenkevych, Frank Cheng, Artsiom Balakir, Alex Nikulkov, Lingnan Gao, Zhihao Cen, Zuobing Xu, Zheqing Zhu

Published: 2024, Last Modified: 06 Mar 2026KDD 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading