Semi-Supervised Classification of Network Data Using Very Few Labels

Frank Lin; William W. Cohen

doi:10.1109/asonam.2010.19

Abstract

The goal of semi-supervised learning (SSL) methods is to reduce the amount of labeled training data required by learning from both labeled and unlabeled instances. Macskassy and Provost (2007) proposed the weighted-vote relational neighbor classifier (wvRN) as a simple yet effective baseline for semi-supervised learning on network data. It is similar to many recent graph-based SSL methods and is shown to be essentially the same as the Gaussian-field harmonic functions classifier proposed by Zhu et al. (2003) and proves to be very effective on some benchmark network datasets. We describe another simple and intuitive semi-supervised learning method based on random graph walk that outperforms wvRN by a large margin on several benchmark datasets when very few labels are available. Additionally, we show that using authoritative instances as training seeds --- instances that arguably cost much less to label --- dramatically reduces the amount of labeled data required to achieve the same classification accuracy. For some existing state-of-the-art semi-supervised learning methods the labeled data needed is reduced by a factor of 50.

Keywords

Computer scienceArtificial intelligenceSemi-supervised learningLabeled dataMachine learningSupervised learningClassifier (UML)Margin (machine learning)GraphBenchmark (surveying)Pattern recognition (psychology)Artificial neural networkTheoretical computer science

Affiliated Institutions

Carnegie Mellon University US

Related Publications

Deeper Insights Into Graph Convolutional Networks for Semi-Supervised Learning

Qimai Li , Zhichao Han , Xiao-Ming Wu

Many interesting problems in machine learning are being revisited with new deep learning tools. For graph-based semi-supervised learning, a recent important development is graph...

2018 Proceedings of the AAAI Conference on... 2431 citations

Unsupervised Feature Learning via Non-parametric Instance Discrimination

Zhirong Wu , Yuanjun Xiong , Stella X. Yu +1 more

Neural net classifiers trained on data with annotated class labels can also capture apparent visual similarity among categories without being directed to do so. We study whether...

2018 3435 citations

Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey

Longlong Jing , Yingli Tian

Large-scale labeled data are generally required to train deep neural networks in order to obtain better performance in visual feature learning from images or videos for computer...

2020 IEEE Transactions on Pattern Analysis... 1818 citations

Seeing stars when there aren't many stars

Andrew B. Goldberg , Xiaojin Zhu

We present a graph-based semi-supervised learning algorithm to address the sentiment analysis task of rating inference. Given a set of documents (e.g., movie reviews) and accomp...

2006 317 citations

Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning

Takeru Miyato , Shin‐ichi Maeda , Masanori Koyama +1 more

We propose a new regularization method based on virtual adversarial loss: a new measure of local smoothness of the conditional label distribution given input. Virtual adversaria...

2018 IEEE Transactions on Pattern Analysis... 2696 citations

Publication Info

Year: 2010
Type: article
Pages: 192-199
Citations: 93
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Semi-Supervised Classification of Network Data Using Very Few Labels

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

OpenAlex

Cite This

APA Style

                            
                                    Frank Lin, 
                                
                                    William W. Cohen
                                
                            (2010). 
                            Semi-Supervised Classification of Network Data Using Very Few Labels. 
                            
                            , 192-199.
                            https://doi.org/10.1109/asonam.2010.19

Identifiers

DOI: 10.1109/asonam.2010.19