Breaking the Reasoning Barrier A Survey on LLM Complex Reasoning through the Lens of Self-Evolution

Published: 30 Jun 2025, Last Modified: 28 Jul 2025Findings of the Association for Computational Linguistics: ACL 2025EveryoneCC BY 4.0
Abstract: The release of OpenAI’s O1 and subsequent projects like DeepSeek R1 has significantly advanced research on complex reasoning in LLMs. This paper systematically analyzes existing reasoning studies from the perspective of self-evolution, structured into three components: data evolution, model evolution, and self-evolution. Data evolution explores methods to generate higher-quality reasoning training data. Model evolution focuses on training strategies to boost reasoning capabilities. Self-evolution research autonomous system evolution via iterating cycles of data and model evolution. We further discuss the scaling law of self-evolution and analyze representative O1-like works through this lens. By summarizing advanced methods and outlining future directions, this paper aims to drive advancements in LLMs’ reasoning abilities.
Loading