Abstract: Highlights•Developed an LLM-based NER pipeline to extract nuanced patient language status from clinical notes.•Cross-site validation (YNHH, MIMIC) showed GPT-4o strong zero-shot accuracy and LLaMA3 robust generalization.•Enables scalable, fine-grained language data extraction to support equitable, language-focused healthcare research.
External IDs:dblp:journals/ijmi/QianHZXWCDLMBLQKADZX26
Loading