Deep Clustering for Unsupervised Learning of Visual Features

Abstract

Clustering is a class of unsupervised learning methods that has been extensively applied and studied in computer vision. Little work has been done to adapt it to the end-to-end training of visual features on large-scale datasets. In this work, we present DeepCluster, a clustering method that jointly learns the parameters of a neural network and the cluster assignments of the resulting features. DeepCluster iteratively groups the features with a standard clustering algorithm, k-means, and uses the subsequent assignments as supervision to update the weights of the network. We apply DeepCluster to the unsupervised training of convolutional neural networks on large datasets like ImageNet and YFCC100M. The resulting model outperforms the current state of the art by a significant margin on all the standard benchmarks.

Keywords

Computer scienceCluster analysisMargin (machine learning)Artificial intelligenceUnsupervised learningPattern recognition (psychology)Convolutional neural networkArtificial neural networkMachine learningClass (philosophy)

Related Publications

Unsupervised Representation Learning with Deep Convolutional Generative\n Adversarial Networks

Alec Radford , Luke Metz , Soumith Chintala

In recent years, supervised learning with convolutional networks (CNNs) has\nseen huge adoption in computer vision applications. Comparatively, unsupervised\nlearning with CNNs ...

2015 arXiv (Cornell University) 7618 citations

Unsupervised Feature Learning via Non-parametric Instance Discrimination

Zhirong Wu , Yuanjun Xiong , Stella X. Yu +1 more

Neural net classifiers trained on data with annotated class labels can also capture apparent visual similarity among categories without being directed to do so. We study whether...

2018 3435 citations

Convolutional Pose Machines

Shih-En Wei , Varun Ramakrishna , Takeo Kanade +1 more

Pose Machines provide a sequential prediction framework for learning rich implicit spatial models. In this work we show a systematic design for how convolutional networks can be...

2016 2728 citations

Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks

Maxime Oquab , Léon Bottou , Ivan Laptev +1 more

Convolutional neural networks (CNN) have recently shown outstanding image classification performance in the large- scale visual recognition challenge (ILSVRC2012). The success o...

2014 3151 citations

CosFace: Large Margin Cosine Loss for Deep Face Recognition

Hao Wang , Yitong Wang , Zheng Zhou +5 more

Face recognition has made extraordinary progress owing to the advancement of deep convolutional neural networks (CNNs). The central task of face recognition, including face veri...

2018 2018 IEEE/CVF Conference on Computer ... 2715 citations

Publication Info

Year: 2018
Type: book-chapter
Pages: 139-156
Citations: 2355
Access: Closed

External Links

Download PDF (Free) View on DOI.org arXiv PubMed Semantic Scholar

Social Impact

Altmetric

Deep Clustering for Unsupervised Learning of Visual Features

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

2355

OpenAlex

229

Influential

Cite This

APA Style

                            
                                    Mathilde Caron, 
                                
                                    Piotr Bojanowski, 
                                
                                    Armand Joulin
                                
                                et al.
                            
                            (2018). 
                            Deep Clustering for Unsupervised Learning of Visual Features. 
                            Lecture notes in computer science
                            
                            , 139-156.
                            https://doi.org/10.1007/978-3-030-01264-9_9

Identifiers

DOI: 10.1007/978-3-030-01264-9_9
PMID: 41357951
PMCID: PMC12678682
arXiv: 1807.05520

Data Quality

Data completeness: 79%