KGSAR: A Knowledge Graph-Based Tool for Managing Spanish Colonial Notary RecordsDownload PDFOpen Website

28 Mar 2023OpenReview Archive Direct UploadReaders: Everyone
Abstract: Notary records contain abundant information relevant to historical inquiry but are in physical form and hence, searching for information in these documents could be painstaking. In this demo paper, we present a document retrieval system that allows users to search for a keyword in digitized copies of physical records. The system uses cleaned and denoised images to search a keyword using optical char- acter recognition (OCR) models re-trained on labeled data provided by experts. The word predictions and bounding boxes are stored as a knowledge graph (KG). A keyword query is then mapped to a graph query on the KG. The results are ranked based on text matching. An intuitive user interface (UI) allows a user to search, correct, delete or draw more annotations that are used for retraining of the OCR models.
0 Replies

Loading