OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Yiran Jenny Shen
PhD student, Computer Science and Engineering, University of California, San Diego
Joined
August 2024
Names
Yiran Jenny Shen
(Preferred)
,
Yiran Shen
Emails
****@duke.edu
(Confirmed)
,
****@ucsd.edu
(Confirmed)
Personal Links
Homepage
LinkedIn
Career & Education History
PhD student
Computer Science and Engineering,
University of California, San Diego
(ucsd.edu)
2024
–
Present
MS student
Duke University
(duke.edu)
2022
–
2024
Advisors, Relations & Conflicts
PhD Advisor
Prithviraj Ammanabrolu
2024
–
Present
Expertise
Natural Language Processing
2024
–
Present
Reinforcement Learning
2024
–
Present
Publications
Pluralistic On-Policy Self-Distillation
Yiran Jenny Shen
,
Yu Xia
,
Liuyi Yao
,
Prithviraj Ammanabrolu
Pluralistic-Alignment 2026
Readers:
Everyone
Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards
Yiran Jenny Shen
,
Yu Xia
,
Jonathan Daniel Chang
,
Prithviraj Ammanabrolu
MATH-AI 2025 Poster
Readers:
Everyone
MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization
Rohan Surana
,
Junda Wu
,
Xintong Li
,
Sheldon Yu
,
Yiran Jenny Shen
,
Chuhan Wang
,
Tong Yu
,
Prithviraj Ammanabrolu
,
Jingbo Shang
,
Julian McAuley
Submitted to ICLR 2026
Readers:
Everyone
Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards
Yiran Jenny Shen
,
Yu Xia
,
Jonathan Daniel Chang
,
Prithviraj Ammanabrolu
Submitted to ICLR 2026
Readers:
Everyone
SAND: Boosting LLM Agents with Self-Taught Action Deliberation
Yu Xia
,
Yiran Jenny Shen
,
Junda Wu
,
Tong Yu
,
Sungchul Kim
,
Ryan A. Rossi
,
Lina Yao
,
Julian McAuley
LAW
Readers:
Everyone
SAND: Boosting LLM Agents with Self-Taught Action Deliberation
Yu Xia
,
Yiran Shen
,
Junda Wu
,
Tong Yu
,
Sungchul Kim
,
Ryan A. Rossi
,
Lina Yao
,
Julian J. McAuley
CoRR 2025
Readers:
Everyone
A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models
Zhouhang Xie
,
Junda Wu
,
Yiran Shen
,
Raghav Jain
,
Yu Xia
,
Xintong Li
,
Aaron Chang
,
Ryan A. Rossi
,
Tong Yu
,
Sachin Kumar
,
Bodhisattwa Prasad Majumder
,
Jingbo Shang
,
Prithviraj Ammanabrolu
,
Julian McAuley
COLM 2025
Readers:
Everyone
In-context Ranking Preference Optimization
Junda Wu
,
Rohan Surana
,
Zhouhang Xie
,
Yiran Shen
,
Yu Xia
,
Tong Yu
,
Ryan A. Rossi
,
Prithviraj Ammanabrolu
,
Julian McAuley
COLM 2025
Readers:
Everyone
Explainable Rewards in RLHF Using LLM-as-a-Judge
Yiran Shen
,
Aditya Emmanuel Arokiaraj John
,
Brandon Fain
ICLR 2025 Conference Withdrawn Submission
Readers:
Everyone
Co-Authors
Aaron Chang
Aditya Emmanuel Arokiaraj John
Bodhisattwa Prasad Majumder
Brandon Fain
Chuhan Wang
Jingbo Shang
Jonathan Daniel Chang
Julian J. McAuley
Julian McAuley
Junda Wu
Lina Yao
Liuyi Yao
Prithviraj Ammanabrolu
Raghav Jain
Rohan Surana
Ryan A. Rossi
Sachin Kumar
Sheldon Yu
Sungchul Kim
Tong Yu
Xintong Li
Yu Xia
Zhouhang Xie