GSM: A generalized approach to Supervised Meta-blocking for scalable entity resolution

Published: 01 Jan 2024, Last Modified: 31 Jul 2024Inf. Syst. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Formalization of meta-blocking as a probabilistic classification task.•A supervised meta-blocking algorithm that requires only 50 examples for training.•Four new weighting schemes that enhance the meta-blocking performance.•Extensive experimental evaluation on 9 real-world datasets and 5 synthetic ones.•Comparison with state-of-the-art recently published blocking solutions.
Loading