Multiplayer Nash Preference Optimization.

Fang Wu, Xu Huang, Weihao Xuan, Zhiwei Zhang, Yijia Xiao, Guancheng Wan, Xiaomin Li, Bing Hu, Peng Xia, Jure Leskovec, Yejin Choi 0001

12 Nov 2025CoRR 2025EveryoneCC BY-SA 4.0
Loading