Toggle navigation
OpenReview
.net
Login
×
Back to
COLM
COLM 2025 Workshop XLLM-Reason-Plan Submissions
Reasoning Riddles: How Explainability Reveals Cognitive Limits in Vision-Language Models
Prahitha Movva
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
Before You 〈think/〉, Monitor: Implementing Flavell's Metacognitive Framework in LLMs
Nick Oh
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
How Post-Training Reshapes LLMs: A Mechanistic View on Knowledge, Truthfulness, Refusal, and Confidence
Hongzhe Du
,
Weikai Li
,
Min Cai
,
Karim Saraipour
,
Zimin Zhang
,
Yizhou Sun
,
Himabindu Lakkaraju
,
Shichang Zhang
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
From Indirect Object Identification to Syllogisms: Exploring Binary Mechanisms in Transformer Circuits
Karim Saraipour
,
Shichang Zhang
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
Failure by Interference: Language Models Make Balanced Parentheses Errors When Faulty Mechanisms Overshadow Sound Ones
Daking Rai
,
Samuel Miller
,
Kevin Moran
,
Ziyu Yao
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
ReCalibrate: RL for Uncertainty-Aware Reasoning in LLMs
Mehul Damani
,
Isha Puri
,
Stewart Slocum
,
Idan Shenfeld
,
Jacob Andreas
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
Are General-Purpose LLMs Ready for Planning? A Large- Scale Evaluation in PDDL
Kaustubh Vyas
,
Damien Graux
,
Sebastien Montella
,
Pavlos Vougiouklis
,
Jeff Z. Pan
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
Case-Based Reasoning Enhances the Predictive Power of LLMs in Drug-Drug Interaction
Guangyi Liu
,
Yongqi Zhang
,
Xunyuan Liu
,
Quanming Yao
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation
Ziling Cheng
,
Meng Cao
,
Leila Pishdad
,
Yanshuai Cao
,
Jackie CK Cheung
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
Angular Steering: Behavior Control via Rotation in Activation Space
Hieu M. Vu
,
Tan Minh Nguyen
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
HYBRIDMIND: Meta Selection of Natural Language and Symbolic Language for Enhanced LLM Reasoning
Simeng Han
,
Tianyu Liu
,
Chuhan Li
,
Xuyuan Xiong
,
Arman Cohan
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
Rethinking (Human) Preference Evaluation of LLM Rationales
Ziang Li
,
Manasi Ganti
,
Zixian Ma
,
Helena Vasconcelos
,
Qijia He
,
Ranjay Krishna
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
Latent Chain-of-Thought? Decoding the Depth-Recurrent Transformer
Wenquan Lu
,
Yuechuan Yang
,
Kyle Lee
,
Yanshu Li
,
Enqi Liu
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
Beyond Autocomplete: Designing CopilotLens Towards Transparent and Explainable AI Coding Agents
Runlong Ye
,
Zeling Zhang
,
Boushra Almazroua
,
Michael Liut
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
Disambiguate First, Parse Later: Generating Interpretations for Ambiguity Resolution in Semantic Parsing
Irina Saparina
,
Mirella Lapata
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data
Jiaming Zhou
,
Abbas Ghaddar
,
Ge Zhang
,
Liheng Ma
,
Yaochen Hu
,
Soumyasundar Pal
,
Bin Wang
,
Jianye HAO
,
Mark Coates
,
Yingxue Zhang
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration
Quan Shi
,
Carlos E Jimenez
,
Shunyu Yao
,
Nick Haber
,
Diyi Yang
,
Karthik R Narasimhan
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
Attributing Response to Context: A Jensen–Shannon Divergence Driven Mechanistic Study of Context Attribution in Retrieval-Augmented Generation
Ruizhe Li
,
Chen Chen
,
Yuchen Hu
,
Yanjun Gao
,
Xi Wang
,
Emine Yilmaz
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone
The Geometry of Self-Verification in a Task-Specific Reasoning Model
Andrew Lee
,
Lihao Sun
,
Chris Wendler
,
Fernanda Viégas
,
Martin Wattenberg
Published: 24 Jul 2025, Last Modified: 04 Oct 2025
XLLM-Reason-Plan
Readers:
Everyone