Enhancing Large-Scale Entity Alignment with Critical Structure and High-Quality Context

Published: 2025, Last Modified: 16 Jan 2026ICDE 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Entity Alignment (EA) aims to identify equivalent entities across multiple Knowledge Graphs (KGs). However, when applied to larger-scale KGs, most existing EA approaches suffer from the scalability issue due to excessive GPU memory and time consumption. To mitigate this, recent advances have introduced the Large-scale EA (LsEA) task, which divides large-scale KG pairs into smaller sub-graph pairs. Despite their promising results, several notable challenges remain, preventing these advances from achieving optimal performance: 1) How to effectively utilize critical structures when generating sub-tasks? 2) How to supplement high-quality context to enhance LsEA performance? 3) How to address scenarios without alignment seeds? To tackle these challenges, we propose a novel method called ELsEA. It comprises three main components: (1) Source and Target Graph Partition, using a Metis-based weighted partitioner and a counter-part candidate generator to partition source and target graphs respectively, aiming to utilize critical structures effectively; (2) Supplement High-quality Context, which utilizes a value-based informativeness-evaluation module and a neighbor enrichment module to assess each entity's informativeness effectively, then supplement high-quality context based on this informativeness; and (3) Seed-free Setup, introducing a mixed-info pseudo-seed generation strategy to mitigate name bias, generating accurate pseudo-seeds when alignment seeds are unavailable. Extensive experiments demonstrate that ELsEA outperforms state-of-the-art baselines. The code of ELsEA is available online11https://githuh.com/wx-qzhou/ELsEA.git.
Loading