OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
XingYu Li
Researcher, HiLab, Xiaohongshu
Joined
September 2024
Names
XingYu Li
(Preferred)
,
XingYu
Emails
****@gmail.com
(Confirmed)
,
****@xiaohongshu.com
(Confirmed)
Personal Links
LinkedIn
Career & Education History
Researcher
HiLab,
Xiaohongshu
(xiaohongshu.com)
2023
–
Present
Advisors, Relations & Conflicts
No relations added
Expertise
AI Alignment
2022
–
Present
Language Representation
2017
–
Present
Language Models
2017
–
Present
Deep Learning
2017
–
Present
Publications
Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Xueru Wen
,
Jie Lou
,
Zichao Li
,
Yaojie Lu
,
XingYu
,
Yuqiu Ji
,
Guohai Xu
,
Hongyu Lin
,
Ben He
,
Xianpei Han
,
Le Sun
,
Debing Zhang
ACL 2025 Main
Readers:
Everyone
Tackling Length Inflation Without Trade-offs: Group Relative Reward Rescaling for Reinforcement Learning
Zichao Li
,
Jie Lou
,
Fangchen Dong
,
Zhiyuan Fan
,
Mengjie Ren
,
Hongyu Lin
,
Xianpei Han
,
Debing Zhang
,
Le Sun
,
Yaojie Lu
,
XingYu Li
ICML 2026 regular
Readers:
Everyone
Scalable Oversight for Superhuman AI via Recursive Self-Critiquing
Xueru Wen
,
Jie Lou
,
Xinyu Lu
,
Junjie Yang
,
yanjiang liu
,
Yaojie Lu
,
Debing Zhang
,
XingYu
ICLR 2026 Conference Desk Rejected Submission
Readers:
Everyone
DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning
Ziwei Zheng
,
Michael Yang
,
Jack Hong
,
Chenxiao Zhao
,
Guohai Xu
,
Le Yang
,
Chao Shen
,
XingYu
ICLR 2026 Poster
Readers:
Everyone
Think When You Need: Self-Adaptive Chain-of-Thought Learning
Junjie Yang
,
Ke Lin
,
XingYu
Submitted to ICLR 2026
Readers:
Everyone
Towards Agentic Self-Learning LLMs in Search Environment
Wangtao Sun
,
Xiang Cheng
,
Jialin Fan
,
Yao Xu
,
XingYu
,
Shizhu He
,
Jun Zhao
,
Kang Liu
Submitted to ICLR 2026
Readers:
Everyone
Probabilistic Uncertain Reward Model
Wangtao Sun
,
Xiang Cheng
,
XingYu
,
Haotian Xu
,
Zhao Yang
,
Shizhu He
,
Jun Zhao
,
Kang Liu
Submitted to ICLR 2026
Readers:
Everyone
Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?
Qingyu Yin
,
Chak Tou Leong
,
Linyi Yang
,
Wenxuan Huang
,
Wenjie Li
,
Xiting Wang
,
Jaehong Yoon
,
YunXing
,
XingYu
,
Jinjin Gu
ICLR 2026 Conference Withdrawn Submission
Readers:
Everyone
DeepEyesV2: Toward Agentic Multimodal Model
Jack Hong
,
Chenxiao Zhao
,
ChengLIn Zhu
,
Weiheng Lu
,
Guohai Xu
,
XingYu
ICLR 2026 Poster
Readers:
Everyone
Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
Xueru Wen
,
Jie Lou
,
Yaojie Lu
,
Hongyu Lin
,
XingYu
,
Xinyu Lu
,
Ben He
,
Xianpei Han
,
Debing Zhang
,
Le Sun
ICLR 2025 Spotlight
Readers:
Everyone
Co-Authors
Ben He
Chak Tou Leong
Chao Shen
ChengLIn Zhu
Chenxiao Zhao
Debing Zhang
Fangchen Dong
Guohai Xu
Haotian Xu
Hongyu Lin
Jack Hong
Jaehong Yoon
Jialin Fan
Jie Lou
Jinjin Gu
Jun Zhao
Junjie Yang
Kang Liu
Ke Lin
Le Sun
Le Yang
Linyi Yang
Mengjie Ren
Michael Yang
Qingyu Yin
View all 44 co-authors