- Keywords: knowledge graph, semantics-aware
- TL;DR: An approach to clean noisy extraction considering an IE system as a blackbox
- Abstract: KG-Cleaner is a semantics-aware framework combining two different approaches for knowledge correction: text-based classiﬁcation to identify, and schema backed classiﬁcation to correct, errors in data produced by information extraction systems. The approach is novel in being an independent, standalone system addressing both tasks in a uniﬁed, joint manner. We evaluate KG-Cleaner and other models on two collections: a Wikidata corpus of 700K facts and 5M fact-relevant sentences and a collection of 30K facts extracted by systems participating in the 2015 TAC Knowledge Base Population task. We ﬁnd that simple parameter-efﬁcient shallow neural networks, combined with a continuous relaxation of a discrete predicted latent variable, provide a good common representation for the two tasks, achieving absolute performance gains of 30-35 F1 points on the evaluation datasets for both credibility prediction and relation repair.
- Archival status: Archival
- Subject areas: Information Extraction