PrLM: Learning Explicit Reasoning for Personalized RAG via Contrastive Reward Optimization

Kepu Zhang, Teng Shi, Weijie Yu, Jun Xu

Published: 10 Nov 2025, Last Modified: 15 Jan 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading