First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning

Published: 01 Jan 2024, Last Modified: 20 May 2025EMNLP 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Explicit multi-step reasoning, such as chain-of-thought, is widely adopted in the community to explore the better performance of language models (LMs). We report on the systematic strategy that LMs use in this process.Our controlled experiments reveal that LMs rely more heavily on heuristics, such as lexical overlap, in the earlier stages of reasoning when more steps are required to reach an answer. Conversely, their reliance on heuristics decreases as LMs progress closer to the final answer. This suggests that LMs track only a limited number of future steps and dynamically combine heuristic strategies with rational ones in solving tasks involving multi-step reasoning.
Loading