Chinese Medical Named Entity Recognition Using External Knowledge

Published: 01 Jan 2022, Last Modified: 06 Aug 2024PRICAI (2) 2022EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Chinese medical named entity recognition (NER) task usually lacks sufficient annotation data, and it contains many medical professional terms and abbreviations, making the NER task more difficult. In addition, compared with English NER, Chinese NER is more challenging because it lacks standard feature symbols to determine named entity boundaries. Therefore, Chinese NER needs to perform word segmentation. In this paper, we are inspired by lexicon-based BERT and propose a novel method for Chinese medical NER task. Besides, We design a template-based strategy to enrich the words’ information and improve the model’s ability to distinguish medical professional terms and abbreviations. Our method enhances the word segmentation accuracy by introducing the external medical lexicon. To verify the effectiveness of our method, we carry out experiments on three medical datasets and our method improves them by 0.92%, 1.18% and 1.55% F1-score compared to baseline.
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview