Dual Architecture for Name Entity Extraction and Relation Extraction with Applications in Medical CorporaDownload PDF


16 Oct 2021 (modified: 05 May 2023)NeurIPS 2021 Workshop LatinX in AI Blind SubmissionReaders: Everyone
Keywords: information extraction, natural language processing, recurrent neural networks
Abstract: There is a growing interest in automatic knowledge discovery in plain text documents. Automation enables the analysis of massive collections of information. Such efforts are especially relevant in the health domain as advancements could use the large volume of available resources to transform areas important for society when addressing various health research challenges. However, knowledge discovery is usually aided by annotated corpora, which are scarce resources in the literature. This situation is particularly critical in the Spanish language, for which the volume of training resources is less widespread. This work considers as a start point existent health-oriented Spanish dataset. In addition, it also creates an English variant using the same tagging system. Furthermore, we design and analyze two separated architectures for Entity Extraction and Relation Recognition that outperform previous works in the Spanish dataset. With such promising results, we also evaluate their performance in the English version.
1 Reply
