Feedback to Reasoning: LLM-Assisted Molecular Optimization with Domain Feedback and Historical Reasoning

Feedback to Reasoning: LLM-Assisted Molecular Optimization with Domain Feedback and Historical Reasoning

ACL ARR 2026 January Submission2865 Authors

03 Jan 2026 (modified: 20 Mar 2026)ACL ARR 2026 January SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Keywords: LLM for Science; Molecular Optimization

Abstract: The success of large language models (LLMs) across domains highlights their potential in scientific tasks, with molecular optimization being a promising frontier. Traditionally, this optimization relies on iterative expert feedback to refine molecules toward desired properties, a process well aligned with LLMs’ strengths. **As an experience-driven task, molecular optimization depends critically on the domain feedback and accumulation of historical knowledge. However, none of the existing methods fully leverages such feedback and historical knowledge with reasoning traces and chemical insights.** In this work, we propose ***F2R***: Feedback to Reasoning, a conversational molecular optimization pipeline that enables LLMs to accumulate and retrieve past actions, rationales, and feedback. Like humans, LLMs can generate imperfect reasoning; F2R is the first framework to use detailed domain feedback to critique and improve this reasoning. This transforms LLMs from passive text generators into agentic experts that learn both actions and reasoning from experience. Consequently, F2R shows remarkable performance.

Paper Type: Long

Research Area: NLP Applications

Research Area Keywords: Clinical NLP

Languages Studied: English

Submission Number: 2865

Loading