The Density Connectivity Information BottleneckDownload PDFOpen Website

2008 (modified: 09 Nov 2022)ICYCS 2008Readers: Everyone
Abstract: Clustering with the agglomerative information bottleneck (aIB) algorithm suffers from the sub-optimality problem, which cannot guarantee to preserve as much relative information as possible. To handle this problem, we introduce a density connectivity chain, by which we consider not only the information between two data elements, but also the information among the neighbors of a data element. Based on this idea, we propose DCIB, a density connectivity information bottleneck algorithm that applies the information bottleneck method to quantify the relative information during the clustering procedure. As a hierarchical algorithm, the DCIB algorithm produces a pruned clustering tree-structure and gets clustering results in different sizes in a single execution. The experiment results in the documentation clustering indicate that the DCIB algorithm can preserve more relative information and achieve higher precision than the aIB algorithm.
0 Replies

Loading