On the Assessment of Deep Learning Models for Named Entity Recognition of Brazilian Legal Documents

Published: 01 Jan 2023, Last Modified: 30 Sept 2024EPIA (2) 2023EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: A large amount of legal and legislative documents are generated every year with highly specialized content and significant repercussions on society. Besides technical, the produced information is not semantically standardized or format structured. Automating the document analysis, categorization, search, and summarization is essential. The Named Entity Recognition (NER) task is one of the tools that have the potential to extract information from legal documents with efficiency. This paper evaluates the state-of-the-art NER models BiLSTM+CRF and BERT+Fine-Tunning trained on Portuguese corpora through finetuning in the legal and legislative domains. The obtained results (F1-scores of 83.17% and 88.27%) suggest that the BERT model is superior, achieving better average results.
Loading