Toggle navigation
OpenReview
.net
Login
×
Back to
ICML
ICML 2025 Workshop R2-FM Submissions
From Tasks to Teams: A Risk-First Evaluation Framework for Multi-Agent LLM Systems in Finance
Zichen Chen
,
Jianda Chen
,
Jiaao Chen
,
Misha Sra
Published: 01 Jul 2025, Last Modified: 11 Jul 2025
ICML 2025 R2-FM Workshop Oral
Readers:
Everyone
Position: Agent-Specific Trustworthiness Risk as a Research Priority
Zeming Wei
,
Tianlin Li
,
Xiaojun Jia
,
Yihao Zhang
,
Yang Liu
,
Meng Sun
Published: 01 Jul 2025, Last Modified: 07 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
Think with Moderation: Reasoning Models and Confidence Calibration in the Climate Domain
Romain Lacombe
,
Kerrie Wu
,
Eddie Dilworth
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
Simple Mechanistic Explanations for Out-Of-Context Reasoning
Zifan Wang
,
Joshua Engels
,
Oliver Clive-Griffin
Published: 01 Jul 2025, Last Modified: 11 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
Verbalized Confidence Triggers Self-Verification : Emergent Behavior Without Explicit Reasoning Supervision
Chaeyun Jang
,
Moonseok Choi
,
Yegon Kim
,
Hyungi Lee
,
Juho Lee
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval
Taiye Chen
,
Zeming Wei
,
Ang Li
,
Yisen Wang
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
Investigating Tool-Memory Conflicts in Tool-Augmented LLMs
Jiali Cheng
,
Rui Pan
,
Hadi Amiri
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
MARVEL: Modular Abstention for Reliable and Versatile Expert LLMs
Bingbing Wen
,
Faeze Brahman
,
Zhan Su
,
Shangbin Feng
,
Yulia Tsvetkov
,
Lucy Lu Wang
,
Bill Howe
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
In Agents We Trust, but Who Do Agents Trust? Latent Source Preferences Steer LLM Generations
Mohammad Aflah Khan
,
Mahsa Amani
,
Soumi Das
,
Bishwamittra Ghosh
,
Qinyuan Wu
,
Krishna P. Gummadi
,
Manish Gupta
,
Abhilasha Ravichander
Published: 01 Jul 2025, Last Modified: 11 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
DINGO: Constrained Inference for Diffusion LLMs
Tarun Suresh
,
Debangshu Banerjee
,
Shubham Ugare
,
Sasa Misailovic
,
Gagandeep Singh
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
SimBA: Simplifying Benchmark Analysis
Nishant Subramani
,
Alfredo Gomez
,
Mona T. Diab
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
Prune 'n Predict: Optimizing LLM Decision-making with Conformal Prediction
Harit Vishwakarma
,
Alan Mishler
,
Thomas Cook
,
Niccolo Dalmasso
,
Natraj Raman
,
Sumitra Ganesh
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
Reliable Statistical Inference with Synthetic Data from Large Language Models
Yewon Byun
,
Shantanu Gupta
,
Zachary Chase Lipton
,
Rachel Leah Childers
,
Bryan Wilder
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
On Characterizations for Language Generation: Interplay of Hallucinations, Breadth, and Stability
Alkis Kalavasis
,
Anay Mehrotra
,
Grigoris Velegkas
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
ASNO: An Interpretable Attention-Based Spatio-Temporal Neural Operator for Robust Scientific Machine Learning
Vispi Nevile Karkaria
,
Doksoo Lee
,
Yi-Ping Chen
,
Yue Yu
,
Wei Chen
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
(Im)possibility of Automated Hallucination Detection in Large Language Models
Amin Karbasi
,
Omar Montasser
,
John Sous
,
Grigoris Velegkas
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
Extracting memorized pieces of (copyrighted) books from open-weight language models
A. Feder Cooper
,
Aaron Gokaslan
,
Ahmed M Ahmed
,
Amy B. Cyphert
,
Christopher De Sa
,
Mark Lemley
,
Daniel E. Ho
,
Percy Liang
Published: 01 Jul 2025, Last Modified: 04 Jul 2025
ICML 2025 R2-FM Workshop Oral
Readers:
Everyone
Auditing, Monitoring, and Intervention for Compliance of Advanced AI Systems
Parand A. Alamdari
,
Toryn Q. Klassen
,
Sheila A. McIlraith
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
TRoVe: Discovering Error-Inducing Static Feature Biases in Temporal Vision-Language Models
Maya Varma
,
Jean-Benoit Delbrouck
,
Sophie Ostmeier
,
Akshay S Chaudhari
,
Curtis Langlotz
Published: 01 Jul 2025, Last Modified: 06 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
A Thousand Words or An Image: Studying the Influence of Persona Modality in Multimodal LLMs
Julius Broomfield
,
Kartik Sharma
,
Srijan Kumar
Published: 01 Jul 2025, Last Modified: 09 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
Aligned Textual Scoring Rule
Yuxuan Lu
,
Yifan Wu
,
Jason Hartline
,
Michael Curry
Published: 01 Jul 2025, Last Modified: 06 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
Transformers Don't In-Context Learn Least Squares Regression
Joshua Hill
,
Benjamin Eyre
,
Elliot Creager
Published: 01 Jul 2025, Last Modified: 10 Jul 2025
ICML 2025 R2-FM Workshop Oral
Readers:
Everyone
A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1
Zhaoyi Li
,
Xiaohan Zhao
,
Dong-Dong Wu
,
Jiacheng Cui
,
Zhiqiang Shen
Published: 01 Jul 2025, Last Modified: 08 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
Escaping the SpuriVerse: Can Large Vision-Language Models Generalize Beyond Seen Spurious Correlations?
Yiwei Yang
,
Chung Peng Lee
,
Shangbin Feng
,
Dora Zhao
,
Bingbing Wen
,
Anthony Zhe Liu
,
Yulia Tsvetkov
,
Bill Howe
Published: 01 Jul 2025, Last Modified: 01 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
GenAI Copyright Evidence with Operational Meaning
Eli Chien
,
Amit Saha
,
Yinan Huang
,
Pan Li
Published: 01 Jul 2025, Last Modified: 07 Jul 2025
ICML 2025 R2-FM Workshop Poster
Readers:
Everyone
«
‹
1
2
3
4
5
›
»