Abstract: Entity resolution (ER) is the problem of identifying and linking database records that refer to the same real-world entity. Traditional ER methods use batch processing, which becomes impractical with growing data volumes due to high computational costs and a lack of real-time capabilities. In many applications, users need to resolve entities for only a small portion of their data, making full data processing unnecessary—a scenario known as “ER-on-demand”. This paper proposes FastER, an efficient ER-on-demand framework for property graphs. Our approach uses graph differential dependencies (GDD) as a knowledge-encoding language to design effective filtering mechanisms that leverage both the structural and attribute semantics of graphs. We construct a blocking graph from the filtered subgraphs to reduce the number of candidate entity pairs that require comparison. Additionally, FastER incorporates Progressive Profile Scheduling (PPS), allowing the system to incrementally produce results throughout the resolution process. Extensive evaluations on multiple benchmark datasets demonstrate that FastER significantly outperforms state-of-the-art ER methods in computational efficiency and real-time processing for on-demand tasks, without compromising quality or reliability. We make FastER publicly available at the Github link here.
External IDs:dblp:conf/semweb/WangZMKBHNF25
Loading