Thinking the Importance of Patient's Chief Complaint in TCM Syndrome Differentiation

Published: 01 Jan 2024, Last Modified: 12 Jan 2025CSCWD 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Traditional Chinese Medicine (TCM) is a natural, safe, and effective therapeutic approach with widespread application worldwide. The unique diagnostic methods of TCM often require a comprehensive analysis of patient information, much of which is contained in clinical text. Numerous studies have demonstrated the effectiveness of natural language processing (NLP) techniques in TCM disease classification. Therefore, this paper focuses on the task of TCM syndrome differentiation, proposing a novel matching score calculation method and a new label attention calculation method to assist the model in focusing on the relationship between TCM syndrome and disease symptom. Specifically, we enhance the model’s attention to the uniqueness of the relationship between TCM syndrome and symptom by introducing a finer-grained token-level matching score. Simultaneously, we improve the model’s attention to the generality of the relationship between TCM syndrome and symptom through a more global label attention mechanism. Additionally, we observe a severe long-tail problem in the dataset. To alleviate this issue, we propose the use of focal loss to help the model pay more attention to challenging samples. Extensive experiments on the TCM-SD dataset indicate that our approach significantly outperforms state-of-the-art baselines 1 .
Loading