[Novel] Conflations and duplications in Wikidata items: causes, detection, solutions, and issues

Published: 29 Aug 2023, Last Modified: 10 Oct 2023ISWC 2023 Workshop Wikidata SubmissionEveryoneRevisionsBibTeX
Abstract: This paper analyzes the problems of incorrect identification of entities in Wikidata items, both in general and focusing on items regarding humans. The problem of incorrect identification is categorized into two types, i.e. conflations and duplications. The paper subsequently treats the causes of conflations and duplications, the methods available for detecting them, the solutions applicable to them and the issues that constitute an obstacle to the aforementioned solutions; three proposals are finally made to mitigate these issues.
Submission Number: 6
Loading