Distributed Clustering Based on Sampling Local Density EstimatesDownload PDF

2003 (modified: 16 Jul 2019)IJCAI 2003Readers: Everyone
Abstract: Huge amounts of data are stored in autonomous, geographically distributed sources. The discovery of previously unknown, implicit and valuable knowledge is a key aspect of the exploitation of such sources. In recent years several approaches to knowledge discovery and data mining, and in particular to clustering, have been developed, but only a few of them are designed for distributed data sources. We propose a novel distributed clustering algorithm based on non-parametric kernel density estimation, which takes into account the issues of privacy and communication costs that arise in a distributed environment.
0 Replies

Loading