Keywords: CO₂ reduction, Catalyst discovery, Large language models (LLMs)
Abstract: The climate crisis demands revolutionary catalysts for CO$\_2$ reduction, yet materials discovery remains bottlenecked by 10--20 year development cycles requiring deep domain expertise. This paper demonstrates how large language models can significantly accelerate and enhance the catalyst discovery process by assisting researchers in exploring vast chemical spaces and interpreting complex results when augmented with retrieval-based grounding. We introduce a retrieval-augmented generation framework that enables GPT-4 to navigate chemical space by accessing a database of 50,000+ known materials, transforming general-purpose language understanding into a powerful tool for high-throughput materials design. Our approach generated over 250 catalyst candidates with an unprecedented 82\% thermodynamic stability rate while addressing multi-objective constraints: 68\% achieved $<100\text{/kg}$ cost with metallic conductivity (band gap $<0.1\~\text{eV})$ and mechanical stability ($B/G > 1.75$). The best-performing $\text{Fe}\_{0.2}\text{Co}\_{0.2}\text{Ni}\_{0.2}\text{Ir}\_{0.1}\text{Ru}\_{0.3}$ achieves 0.285~V limiting potential (25\% improvement over $\text{IrO}\_2$), while $\text{Cr}\_{0.2}\text{Fe}\_{0.2}\text{Co}\_{0.3}\text{Ni}\_{0.2}\text{Mo}\_{0.1}$ optimally balances performance-cost trade-offs at $18\text{/kg}$. Volcano plot analysis confirms that 78\% of LLM-generated catalysts cluster near the theoretical activity optimum, while our system achieves $200 \times$ computational efficiency compared to traditional high-throughput screening. By demonstrating that retrieval-augmented generation can ground AI creativity in physical constraints without sacrificing exploration, this work establishes a new paradigm where natural language interfaces streamline materials discovery workflows, enabling researchers to explore chemical spaces more efficiently while the LLM assists in result interpretation and hypothesis generation.
Supplementary Material: zip
Submission Number: 295
Loading