A Framework Based on Data Augmentation for Knowledge Graph Entity Typing

Zepeng Li, Minyu Zhai, Xiao Li, Rikui Huang, Chenhui Liang, Bin Hu

Published: 2025, Last Modified: 27 Jul 2025ICASSP 2025EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: The task of knowledge graph entity typing (KGET) aims to infer the missing types for entities in knowledge graphs, which is a significant subtask of knowledge graph completion (KGC). In despite of its progress, we observe that the sparsity of the dataset greatly affects the task itself as well as downstream tasks. In this paper, we propose a framework to alleviate this problem, which consists of data augmentation and type inference. We introduce a Statistics-based Entity Type Data Augmentation (SET-DA) method in data augmentation phase, which calculates a type probability distribution for each entity by statistically determining the global relation-type statistics information, and then employ embedding-based KGET models for type inference. Experimental results on two widely used datasets indicate that the proposed framework can solve the data sparsity problem in the task of KGET and the performance of models trained with data augmentation by SET-DA significantly outperforms previous state-of-the-art methods.