INFORM : Information eNtropy based multi-step reasoning FOR large language Models

Chuyue Zhou; WangJie You; Juntao Li; Jing Ye; Kehai Chen; Min Zhang

INFORM : Information eNtropy based multi-step reasoning FOR large language Models

Chuyue Zhou, WangJie You, Juntao Li, Jing Ye, Kehai Chen, Min Zhang

Published: 07 Oct 2023, Last Modified: 01 Dec 2023EMNLP 2023 MainEveryoneRevisionsBibTeX

Submission Type: Regular Long Paper

Submission Track: Theme Track: Large Language Models and the Future of NLP

Submission Track 2: Language Modeling and Analysis of Language Models

Keywords: Chain-of-Thoughts, Multi-Step Reasoning, Large Language Models, Prompting, In-context Learning

Abstract: Large language models (LLMs) have demonstrated exceptional performance in reasoning tasks with dedicated Chain-of-Thought (CoT) prompts. Further enhancing CoT prompts with exquisite exemplars can significantly improve reasoning performance.However, the effectiveness of CoT prompts may fluctuate dramatically with different choices of in-context examples. Additionally, manual construction of rationale steps can be time-consuming, presenting challenges for the widespread adoption of CoT prompting. In this work, we propose a novel approach by introducing information entropy (IE) as a criteria on for CoT prompt selection. We extend this criterion to the CoT generation and inference stages, automatically generating CoT prompts with higher information entropy scores and adaptively determining the number of samples. These three stages together form our proposed information- entropy-based multi-step reasoning for large language models, named INFORM. Our experiments across seven reasoning benchmarks utilizing two language models(GPT-3.5-Turbo and text-davinci-003) demonstrate the superiority of INFORM both in performance and efficiency.

Submission Number: 5570

Loading