Sublinear time approximate clusteringOpen Website

Published: 2001, Last Modified: 12 May 2023SODA 2001Readers: Everyone
Abstract: Clustering is of central importance in a number of disciplines including Machine Learning, Statistics, and Data Mining. This paper has two foci: (1) It describes how existing algorithms for clustering can benefit from simple sampling techniques arising from work in statistics [Pol84]. (2) It motivates and introduces a new model of clustering that is in the spirit of the “PAC (probably approximately correct)” learning model, and gives examples of efficient PAC-clustering algorithms.
0 Replies

Loading