Retrieval of synthesis parameters of polymer nanocomposites using LLMs

Published: 11 Dec 2023, Last Modified: 11 Dec 2023AI4Mat-2023 PosterEveryoneRevisionsBibTeX
Submission Track: Findings
Submission Category: All of the above
Keywords: LLMs, information extraction, synthesis parameters
TL;DR: We developed strategies using large language models to rapidly extract detailed materials data from publications to populate a materials database, and demonstrated initial proof of concept on polymer nanocomposites.
Abstract: Automated materials synthesis requires historical data, but extracting detailed data and metadata from publications is challenging. We developed initial strategies for using large language models for rapid, autonomous data extraction from materials science articles in a format curatable by a materials database. We used the sub-domain of polymer nanocomposites as our example use case and demonstrated a proof of concept case study via manual validation. We used Claude 2 chat, Open AI GPT-3.5, and 4 API to extract characterization methods and general information about the samples, utilizing zero and few-shot prompting to elicit more detailed and accurate responses. We achieved the best results with an F1 score of 0.88 in the sample extraction task, using Claude 2 chat. Our findings demonstrate the utility of language models for more effective and practical retrieval of synthesis parameters from literature.
Submission Number: 80
Loading