Your Model Diversity, Not Method, Determines Reasoning Strategy

Moulik Choraria; Argyrios Gerogiannis; Anirban Das; Supriyo Chakraborty; Berkcan Kapusuzoglu; Chia-Hsuan Lee; Kartik Balasubramaniam; Shi-Xiong Zhang; Sambit Sahu

Your Model Diversity, Not Method, Determines Reasoning Strategy

Moulik Choraria, Argyrios Gerogiannis, Anirban Das, Supriyo Chakraborty, Berkcan Kapusuzoglu, Chia-Hsuan Lee, Kartik Balasubramaniam, Shi-Xiong Zhang, Sambit Sahu

Published: 01 Apr 2026, Last Modified: 25 Apr 2026ICLR 2026 Workshop LLM ReasoningEveryoneRevisionsBibTeXCC BY 4.0

Track: tiny / short paper (up to 4 pages)

Keywords: Reasoning; Diversity; MCTS

TL;DR: Before adopting any exploration strategy for LLM reasoning, characterize the target model's diversity profile; the optimal breadth-depth trade-off is a property of the model, not the method.

Abstract: Compute scaling for LLM reasoning requires allocating budget between exploring solution approaches (\emph{breadth}) and refining promising solutions (\emph{depth}). Most methods implicitly trade off one for the other, yet why a given trade-off works remains unclear, and validation on a single model obscures the role of the model itself. We argue that \textbf{the optimal strategy depends on the model's \emph{diversity profile}, the spread of probability mass across solution approaches, and that this must be characterized before any exploration strategy is adopted.} We formalize this through a theoretical framework decomposing reasoning uncertainty and derive conditions under which tree-style depth refinement outperforms parallel sampling. We validate it on Qwen-3 4B and Olmo-3 7B families, showing that lightweight signals suffice for depth-based refinement on low-diversity aligned models while yielding limited utility for high-diversity base models, which we hypothesize require stronger compensation for lower exploration coverage.

Anonymization: This submission has been anonymized for double-blind review via the removal of identifying information such as names, affiliations, and identifying URLs.

Submission Number: 193

Loading