LLM-Driven Lab Result Extraction from Electronic Health Records

Published: 19 Aug 2025, Last Modified: 12 Oct 2025BHI 2025EveryoneRevisionsBibTeXCC BY 4.0
Confirmation: I have read and agree with the IEEE BHI 2025 conference submission's policy on behalf of myself and my co-authors.
Keywords: Electronic Health Records, Natural Language Processing, Large Language Models, Structured Data Extraction
TL;DR: We trained and tested multiple large language models for extracting structured spirometry data from unstructured clinical notes. The best-performing model achieved high accuracy
Abstract: Extracting structured lab results from electronic health records (EHRs) is critical for large-scale clinical research and patient care but remains challenging due to the unstructured nature of EHR data. This study evaluates large language models (LLMs)—including Phi4, LLaMa-3, and Qwen2.5—across zero-shot, one-shot, few-shot, and fully fine-tuned settings to assess their ability to extract key lab values from clinical text. We analyze prompt design, adaptation strategies, and the effects of quantization on speed and accuracy. LLaMa-3 (8B) with full fine-tuning achieved the highest accuracy (93.79%), while LLaMa-3.3 (70B) excelled in few-shot mode (89.56%). Incorporating additional spirometry data further improved asthma severity classification accuracy from 0.72 to 0.85. These findings demonstrate that well-designed prompts and fine-tuning enable efficient and accurate LLM-based data extraction, with clear trade-offs between model size, speed, and performance.
Track: 6. Theoretical Biomedical Informatics
Registration Id: LDNXKQD2SVW
Submission Number: 236
Loading