Toggle navigation
OpenReview
.net
Login
×
Back to
ICLR
ICLR 2025 Workshop LLM Reason and Plan Submissions
MAS-GPT: Training LLMs To Build LLM-Based Multi-Agent Systems
Rui Ye
,
Shuo Tang
,
Rui Ge
,
Yaxin Du
,
Zhenfei Yin
,
Jing Shao
,
Siheng Chen
Published: 05 Mar 2025, Last Modified: 05 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers (Abridged)
Shalev Lifshitz
,
Sheila A. McIlraith
,
Yilun Du
Published: 05 Mar 2025, Last Modified: 20 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Local Look-Ahead Guidance via Verifier-in-the-Loop for Automated Theorem Proving
Sara Rajaee
,
Kumar Pratik
,
Gabriele Cesa
,
Arash Behboodi
Published: 05 Mar 2025, Last Modified: 18 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Strategic LLM Decoding through Bayesian Games
Weitong Zhang
,
Chengqi Zang
,
Bernhard Kainz
Published: 05 Mar 2025, Last Modified: 11 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Revealing chemical reasoning in LLMs through search on complex planning tasks
Andres M Bran
,
Théo A. Neukomm
,
Daniel P Armstrong
,
Zlatko Jončev
,
Philippe Schwaller
Published: 05 Mar 2025, Last Modified: 05 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
MastermindEval: A Simple But Scalable Reasoning Benchmark
Jonas Golde
,
Patrick Haller
,
Fabio Barth
,
Alan Akbik
Published: 05 Mar 2025, Last Modified: 13 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws
Prasanna Mayilvahanan
,
Thaddäus Wiedemer
,
Sayak Mallick
,
Matthias Bethge
,
Wieland Brendel
Published: 17 Mar 2025, Last Modified: 17 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Advancing Multimodal In-Context Learning in Large Vision-Language Models with Task-aware Demonstrations
Yanshu Li
Published: 05 Mar 2025, Last Modified: 20 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Understanding Reasoning in Thinking Language Models via Steering Vectors
Constantin Venhoff
,
Iván Arcuschin
,
Philip Torr
,
Arthur Conmy
,
Neel Nanda
Published: 05 Mar 2025, Last Modified: 20 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Reasoning Effort and Problem Complexity: A Scaling Analysis in LLMs
Benjamin Estermann
,
Roger Wattenhofer
Published: 05 Mar 2025, Last Modified: 18 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
LookPlanGraph: Embodied instruction following method with VLM graph augmentation
Anatoly Onishchenko
,
Alexey Kovalev
,
Aleksandr Panov
Published: 05 Mar 2025, Last Modified: 20 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
PHYSICS: Benchmarking Foundation Models for Problem Solving in Physics
Kaiyue Feng
,
Yilun Zhao
,
Yixin Liu
,
Tianyu Yang
,
Chen Zhao
,
John Sous
,
Arman Cohan
Published: 05 Mar 2025, Last Modified: 19 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Rethinking Fine-tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning
Feng Chen
,
Allan Raventos
,
Nan Cheng
,
Surya Ganguli
,
Shaul Druckmann
Published: 05 Mar 2025, Last Modified: 19 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
PC-Agent: A Hierarchical Agentic Framework for Complex Task Automation on PC
Haowei Liu
,
Xi Zhang
,
Haiyang Xu
,
Yuyang Wanyan
,
Junyang Wang
,
Ming Yan
,
Ji Zhang
,
Chunfeng Yuan
,
Changsheng Xu
,
Weiming Hu
,
Fei Huang
Published: 05 Mar 2025, Last Modified: 20 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
FLEX-TRAVELPLANNER: A BENCHMARK FOR FLEXIBLE PLANNING WITH LANGUAGE AGENTS
Juhyun Oh
,
Eunsu Kim
,
Alice Oh
Published: 05 Mar 2025, Last Modified: 19 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Improving Test-Time Search for LLMs with Backtracking Against In-Context Value Verifiers
Anikait Singh
,
Kushal Arora
,
Sedrick Keh
,
Jean Mercat
,
Tatsunori Hashimoto
,
Chelsea Finn
,
Aviral Kumar
Published: 05 Mar 2025, Last Modified: 19 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Resolving Ambiguity through Personalization in LLM chat systems
Sophia Huiwen Sun
,
Abishek Sankararaman
,
Balakrishnan Murali Narayanaswamy
Published: 05 Mar 2025, Last Modified: 19 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Cutting Through the Noise: Boosting LLM Performance on Math Word Problems
Ujjwala Anantheswaran
,
Himanshu Gupta
,
Kevin Scaria
,
Shreyas Verma
,
Chitta Baral
,
Swaroop Mishra
Published: 05 Mar 2025, Last Modified: 16 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Training Large Language Models to Reason in a Continuous Latent Space
Shibo Hao
,
Sainbayar Sukhbaatar
,
DiJia Su
,
Xian Li
,
Zhiting Hu
,
Jason E Weston
,
Yuandong Tian
Published: 05 Mar 2025, Last Modified: 20 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
s1: Simple test-time scaling
Niklas Muennighoff
,
Zitong Yang
,
Weijia Shi
,
Xiang Lisa Li
,
Li Fei-Fei
,
Hannaneh Hajishirzi
,
Luke Zettlemoyer
,
Percy Liang
,
Emmanuel Candes
,
Tatsunori Hashimoto
Published: 05 Mar 2025, Last Modified: 15 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Multi-Turn Code Generation Through Single-Step Rewards
Arnav Kumar Jain
,
Gonzalo Gonzalez-Pumariega
,
Wayne Chen
,
Alexander M Rush
,
Wenting Zhao
,
Sanjiban Choudhury
Published: 05 Mar 2025, Last Modified: 20 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Teaching Transformers Causal Reasoning through Axiomatic Training
Aniket Vashishtha
,
Abhinav Kumar
,
Atharva Pandey
,
Abbavaram Gowtham Reddy
,
Kabir Ahuja
,
Vineeth N. Balasubramanian
,
Amit Sharma
Published: 05 Mar 2025, Last Modified: 19 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Offline Reinforcement Learning for LLM Multi-Step Reasoning
Huaijie Wang
,
Shibo Hao
,
Hanze Dong
,
Shenao Zhang
,
Yilin Bao
,
Ziran Yang
,
Yi Wu
Published: 05 Mar 2025, Last Modified: 22 Apr 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
Value-Based Deep RL Scales Predictably
Oleh Rybkin
,
Michal Nauman
,
Preston Fu
,
Charlie Victor Snell
,
Pieter Abbeel
,
Sergey Levine
,
Aviral Kumar
Published: 05 Mar 2025, Last Modified: 19 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
InductionBench: LLMs Fail in the Simplest Complexity Class
Wenyue Hua
,
Fei Sun
,
Liangming Pan
,
Adam Jardine
,
William Yang Wang
Published: 05 Mar 2025, Last Modified: 20 Mar 2025
Reasoning and Planning for LLMs @ ICLR2025
Readers:
Everyone
«
‹
1
2
3
4
5
›
»