PCoAD: The Pharmaceutical Certificate of Analysis Dataset Based on LLMs Validation and Calibration

PCoAD: The Pharmaceutical Certificate of Analysis Dataset Based on LLMs Validation and Calibration

ACL ARR 2026 January Submission2457 Authors

03 Jan 2026 (modified: 20 Mar 2026)ACL ARR 2026 January SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Large language mode, Agent, Adaptive Retrieval, Dataset, Pharmaceutical inspection

Abstract: Artificial intelligence is now widely applied in drug discovery and development, accelerating the entry of novel pharmaceuticals into clinical practice. As the variety of drugs expands, the workload for pharmaceutical inspection has increased significantly, making the demand for automated verification of pharmaceutical inspection results increasingly urgent. However, current research lacks reliable evaluation methods and datasets. To address this issue, we construct a Pharmaceutical Certificate of Analysis Dataset (PCoAD) based on large language model validation and correction. This dataset comprises 4,272 manually verified pharmaceutical certificates of analysis (PCoAs) based on Chinese and U.S. pharmacopoeias, comprehensively testing the ability of model to verify pharmaceutical inspection processes. Based on PCoAD, we propose the Multi-Agent Cooperation based on Adaptive Retrieval (MACAR) framework. This framework employs text chunking, adaptive retrieval, and inference to validate PCoAs. Experimental results demonstrate that MACAR outperforms multiple state-of-the-art methods across various types of tasks.

Paper Type: Long

Research Area: Clinical and Biomedical Applications

Research Area Keywords: LLM/AI agents, retrieval-augmented generation, logical reasoning, healthcare applications

Contribution Types: Model analysis & interpretability, NLP engineering experiment, Data resources

Languages Studied: English, Chinese

Submission Number: 2457

Loading