Constructing a Knowledge Graph from Open Statistical Data: The Case of Nova Scotia DiseaseDatasetsDownload PDF

06 Mar 2022 (modified: 23 May 2023)Submitted to KGCW 2022Readers: Everyone
Keywords: Open statistical data, Nova Scotia, Knowledge graph, Disease dataset
TL;DR: In this paper, we discuss a knowledge graph process and lessons learned during the knowledge graph consturction process.
Abstract: The majority of available datasets in open government data are statistical. They are widely published by different governments to be used by the public and data consumers. However, most datasets in open data portals are not provided in RDF format. Moreover, the datasets are isolated from one another while conceptually connected. Through this paper, a knowledge graph is constructed for the disease-related datasets of a Canadian government data portal, Nova Scotia Open Data. We transformed all the disease-related datasets to RDF according to the Semantic Web standards and enriched them by semantic rules and an external ontology. The ontology designed to develop the graph adheres to best practices and standards, allowing for expansion, modification and flexible re-use (https://zenodo.org/record/5517236#.Ye_MsfXMJb8). The study also discusses the lessons learned during the cross-dimensional knowledge graph construction and integrating open statistical datasets from multiple sources.
5 Replies

Loading