Abstract

The k-means algorithm is generally the most known and used clustering method. There are various extensions of k-means to be proposed in the literature. Although it is an unsupervised learning to clustering in pattern recognition and machine learning, the k-means algorithm and its extensions are always influenced by initializations with a necessary number of clusters a priori. That is, the k-means algorithm is not exactly an unsupervised clustering method. In this paper, we construct an unsupervised learning schema for the k-means algorithm so that it is free of initializations without parameter selection and can also simultaneously find an optimal number of clusters. That is, we propose a novel unsupervised k-means (U-k-means) clustering algorithm with automatically finding an optimal number of clusters without giving any initialization and parameter selection. The computational complexity of the proposed U-k-means clustering algorithm is also analyzed. Comparisons between the proposed U-k-means and other existing methods are made. Experimental results and comparisons actually demonstrate these good aspects of the proposed U-k-means clustering algorithm.

Keywords

Computer scienceCluster analysisArtificial intelligenceCanopy clustering algorithmUnsupervised learningPattern recognition (psychology)Correlation clusteringAlgorithm

Affiliated Institutions

Related Publications

Publication Info

Year
2020
Type
article
Volume
8
Pages
80716-80727
Citations
1917
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1917
OpenAlex
35
Influential
1658
CrossRef

Cite This

Kristina P. Sinaga, Miin‐Shen Yang (2020). Unsupervised K-Means Clustering Algorithm. IEEE Access , 8 , 80716-80727. https://doi.org/10.1109/access.2020.2988796

Identifiers

DOI
10.1109/access.2020.2988796

Data Quality

Data completeness: 86%