Multi-Label Image Recognition With Graph Convolutional Networks

Abstract

The task of multi-label image recognition is to predict a set of object labels that present in an image. As objects normally co-occur in an image, it is desirable to model the label dependencies to improve the recognition performance. To capture and explore such important dependencies, we propose a multi-label classification model based on Graph Convolutional Network (GCN). The model builds a directed graph over the object labels, where each node (label) is represented by word embeddings of a label, and GCN is learned to map this label graph into a set of inter-dependent object classifiers. These classifiers are applied to the image descriptors extracted by another sub-net, enabling the whole network to be end-to-end trainable. Furthermore, we propose a novel re-weighted scheme to create an effective label correlation matrix to guide information propagation among the nodes in GCN. Experiments on two multi-label image recognition datasets show that our approach obviously outperforms other existing state-of-the-art methods. In addition, visualization analyses reveal that the classifiers learned by our model maintain meaningful semantic topology.

Keywords

Computer sciencePattern recognition (psychology)Artificial intelligenceGraphMulti-label classificationVisualizationCognitive neuroscience of visual object recognitionConvolutional neural networkSet (abstract data type)Contextual image classificationImage (mathematics)Feature extractionTheoretical computer science

Affiliated Institutions

Related Publications

Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition

Lei Shi , Yifan Zhang , Jian Cheng +1 more

In skeleton-based action recognition, graph convolutional networks (GCNs), which model the human body skeletons as spatiotemporal graphs, have achieved remarkable performance. H...

2019 2019 IEEE/CVF Conference on Computer ... 1838 citations

Graph Convolutional Networks for Text Classification

Liang Yao , Chengsheng Mao , Yuan Luo

Text classification is an important and classical problem in natural language processing. There have been a number of studies that applied convolutional neural networks (convolu...

2019 Proceedings of the AAAI Conference on... 1867 citations

Actional-Structural Graph Convolutional Networks for Skeleton-Based Action Recognition

Maosen Li , Siheng Chen , Xu Chen +3 more

Action recognition with skeleton data has recently attracted much attention in computer vision. Previous studies are mostly based on fixed skeleton graphs, only capturing local ...

2019 1197 citations

Single-Shot Refinement Neural Network for Object Detection

Shifeng Zhang , Longyin Wen , Xiao Bian +2 more

For object detection, the two-stage approach (e.g., Faster R-CNN) has been achieving the highest accuracy, whereas the one-stage approach (e.g., SSD) has the advantage of high e...

2018 2018 IEEE/CVF Conference on Computer ... 1645 citations

Deeper Insights Into Graph Convolutional Networks for Semi-Supervised Learning

Qimai Li , Zhichao Han , Xiao-Ming Wu

Many interesting problems in machine learning are being revisited with new deep learning tools. For graph-based semi-supervised learning, a recent important development is graph...

2018 Proceedings of the AAAI Conference on... 2431 citations

Publication Info

Year: 2019
Type: article
Pages: 5172-5181
Citations: 1170
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Multi-Label Image Recognition With Graph Convolutional Networks

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1170

OpenAlex

Cite This

APA Style

                            
                                    Zhao-Min Chen, 
                                
                                    Xiu-Shen Wei, 
                                
                                    Peng Wang
                                
                                et al.
                            
                            (2019). 
                            Multi-Label Image Recognition With Graph Convolutional Networks. 
                            
                            , 5172-5181.
                            https://doi.org/10.1109/cvpr.2019.00532

Identifiers

DOI: 10.1109/cvpr.2019.00532