Evaluating Extrapolation Ability of Large Language Model in Chemical Domain

Published: 06 Jul 2024, Last Modified: 28 Jul 2024Language and Molecules ACL 2024 PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Chemistry, LLM, Extrapolation
TL;DR: LLM can extrapolate to new unseen material utilizing its chemical knowledge learned through massive pre-training.
Abstract: Solving a problem outside the training space, i.e. extrapolation, has been a long problem in the machine learning community. The current success of large language models demonstrates the LLM's extrapolation ability to several unseen tasks. In line with these works, we evaluate the LLM's extrapolation ability in the chemical domain. We construct a data set measuring the material properties of epoxy polymers depending on various raw materials and curing processes. LLM should predict the material property when novel raw material is introduced utilizing its chemical knowledge. Through experiments, LLM tends to choose the right direction of adjustment but fails to determine the exact degree, resulting in poor MAE on some properties. But LLM can successfully adjust the degree with only a one-shot example. The results show that LLM can extrapolate to new unseen material utilizing its chemical knowledge learned through massive pre-training.
Archival Option: The authors of this submission want it to appear in the archival proceedings.
Submission Number: 3
Loading