Keywords: Clustering, Benchmark
Abstract: Clustering is a fundamental problem in data science with a long-standing research history. Over the past few decades, numerous clustering algorithms have been developed. However, a systematic and experimental evaluation of these algorithms remains lacking and is urgently needed. To address this gap, we introduce CLUBench, a comprehensive clustering benchmark comprising 23 algorithms of diverse principles evaluated on 131 datasets across tabular, text and image data types. Our extensive experiments (174,485) yield statistically meaningful insights into the performance of various clustering methods, such as the impact of hyperparameter tuning, similarity between algorithms, and the impact of data type and dimension.
Notably, we observe low-rank characteristics in cross-model performance matrices, which facilitates an efficient strategy for rapid algorithm evaluation and selection in practical applications. Additionally, we provide an easy-to-use toolbox by encapsulating the source codes from the official code repository into a unified framework, accompanied by detailed instructions. With CLUBench, researchers and practitioners can efficiently select appropriate algorithms or datasets for evaluating new datasets or proposed methods. All benchmark datasets and the toolbox are fully open-sourced and available at https://anonymous.4open.science/r/CLUBench-ICLR2026/.
Primary Area: datasets and benchmarks
Submission Number: 4099
Loading