Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning

Ruiyang Xu, Jalaj Bhandari, Dmytro Korenkevych, Fan Liu, Yuchen He, Alex Nikulkov, Zheqing Zhu

Published: 2023, Last Modified: 06 Mar 2026RecSys 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading