PrLM: Learning Explicit Reasoning for Personalized RAG via Contrastive Reward Optimization.

Kepu Zhang, Teng Shi, Weijie Yu 0003, Jun Xu 0001

14 Jan 2026CIKM 2025EveryoneCC BY-SA 4.0
Loading