Abstract: Knowledge graph embedding (KGE) models encode the structural information of knowledge graphs to predicting new links.
Effective training of these models requires distinguishing between positive and negative samples with high precision.
Although prior research has shown that improving the quality of negative samples can significantly enhance model accuracy, identifying high-quality negative samples remains a challenging problem.
This paper theoretically investigates the condition under which negative samples lead to optimal KG embedding and identifies a sufficient condition for an effective negative sample distribution. Based on this theoretical foundation, we propose \textbf{E}mbedding \textbf{MU}tation (\textsc{EMU}), a novel framework that \emph{generates} negative samples satisfying this condition, in contrast to conventional methods that focus on \emph{identifying} challenging negative samples within the training data.
Importantly, the simplicity of \textsc{EMU} ensures seamless integration with existing KGE models and negative sampling methods.
To evaluate its efficacy, we conducted comprehensive experiments across multiple datasets. The results consistently demonstrate significant improvements in link prediction performance across various KGE models and negative sampling methods. Notably, \textsc{EMU} enables performance improvements comparable to those achieved by models with embedding dimension five times larger.
An implementation of the method and experiments are available at \url{https://github.com/nec-research/EMU-KG}.
Submission Length: Regular submission (no more than 12 pages of main content)
Changes Since Last Submission: (The following is the same post that the authors provided as a response to the action editor's decision on March 24th.)
W1: We have added a discussion in Footnote 5 on the effectiveness of EMU on WN18RR to better support our approach on this dataset.
W2: We have included guidance on selecting the hyper-parameter of EMU in the final section of Appendix K, titled “Hyper-Parameter Dependence Study.”
The modified and newly added sections are highlighted in red for clarity. Upon acceptance, we will prepare the unanonymized camera-ready version. We hope these revisions sufficiently address your concerns and that our paper is now suitable for acceptance.
Code: https://github.com/nec-research/EMU-KG
Assigned Action Editor: ~quanming_yao1
Submission Number: 3959
Loading