RLP: Reinforcement as a Pretraining Objective.

Ali Hatamizadeh, Syeda Nahida Akter, Shrimai Prabhumoye, Jan Kautz, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Yejin Choi 0001

12 Nov 2025CoRR 2025EveryoneCC BY-SA 4.0
Loading