Significance-based decision tree for interpretable categorical data clustering

Published: 01 Jan 2025, Last Modified: 20 Jul 2025Inf. Sci. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•A testing-based decision tree, SigDT, is presented for clustering categorical data.•The split point evaluation issue is formulated as a multiple testing problem.•SigDT conducts clusterability prediction and cluster analysis simultaneously.•SigDT determines the number of clusters automatically via significance testing.
Loading