An Off-Policy Learning Approach for Steering Sentence Generation towards Personalization

Haruka Kiyohara, Daniel Yiming Cao, Yuta Saito, Thorsten Joachims

Published: 22 Sept 2025, Last Modified: 25 Jan 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading