Abstract

We motivate and develop a natural bicriteria measure for assessing the quality of a clustering that avoids the drawbacks of existing measures. A simple recursive heuristic is shown to have poly-logarithmic worst-case guarantees under the new measure. The main result of the article is the analysis of a popular spectral algorithm. One variant of spectral clustering turns out to have effective worst-case guarantees; another finds a "good" clustering, if one exists.

Keywords

Cluster analysisHeuristicLogarithmMeasure (data warehouse)Computer scienceSimple (philosophy)Spectral clusteringQuality (philosophy)AlgorithmData miningMathematical optimizationMathematicsArtificial intelligence

Affiliated Institutions

Related Publications

Publication Info

Year
2004
Type
article
Volume
51
Issue
3
Pages
497-515
Citations
842
Access
Closed

External Links

Social Impact

Altmetric
PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

842
OpenAlex

Cite This

Ravi Kannan, Santosh Vempala, Adrian Vetta (2004). On clusterings. Journal of the ACM , 51 (3) , 497-515. https://doi.org/10.1145/990308.990313

Identifiers

DOI
10.1145/990308.990313