An Overview of Overfitting and its Solutions

Ying Xue

doi:10.1088/1742-6596/1168/2/022022

Abstract

Overfitting is a fundamental issue in supervised machine learning which prevents us from perfectly generalizing the models to well fit observed data on training data, as well as unseen data on testing set. Because of the presence of noise, the limited size of training set, and the complexity of classifiers, overfitting happens. This paper is going to talk about overfitting from the perspectives of causes and solutions. To reduce the effects of overfitting, various strategies are proposed to address to these causes: 1) "early-stopping" strategy is introduced to prevent overfitting by stopping training before the performance stops optimize; 2) "network-reduction" strategy is used to exclude the noises in training set; 3) "data-expansion" strategy is proposed for complicated models to fine-tune the hyper-parameters sets with a great amount of data; and 4) "regularization" strategy is proposed to guarantee models performance to a great extent while dealing with real world issues by feature-selection, and by distinguishing more useful and less useful features.

Keywords

OverfittingEarly stoppingComputer scienceMachine learningArtificial intelligenceSet (abstract data type)Training setFeature selectionRegularization (linguistics)Feature (linguistics)Selection (genetic algorithm)Data setData miningPattern recognition (psychology)Artificial neural network

Related Publications

Unbiased Recursive Partitioning: A Conditional Inference Framework

Torsten Hothorn , Kurt Hornik , Achim Zeileis

Recursive binary partitioning is a popular tool for regression analysis. Two fundamental problems of exhaustive search procedures usually applied to fit such models have been kn...

2006 Journal of Computational and Graphica... 3906 citations

Feature Selection and Dualities in Maximum Entropy Discrimination

Tony Jebara , Tommi Jaakkola

Incorporating feature selection into a classification or regression method often carries a number of advantages. In this paper we formalize feature selection specifically from a...

2013 arXiv (Cornell University) 76 citations

When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs

Gong Cheng , Ceyuan Yang , Xiwen Yao +2 more

Remote sensing image scene classification is an active and challenging task driven by many applications. More recently, with the advances of deep learning models especially conv...

2018 IEEE Transactions on Geoscience and R... 1205 citations

Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties

Jianqing Fan , Runze Li

Variable selection is fundamental to high-dimensional statistical modeling, including nonparametric regression. Many approaches in use are stepwise selection procedures, which c...

2001 Journal of the American Statistical A... 8835 citations

Feature selection: evaluation, application, and small sample performance

Anil K. Jain , Douglas E. Zongker

A large number of algorithms have been proposed for feature subset selection. Our experimental results show that the sequential forward floating selection algorithm, proposed by...

1997 IEEE Transactions on Pattern Analysis... 2147 citations

Publication Info

Year: 2019
Type: article
Volume: 1168
Pages: 022022-022022
Citations: 2055
Access: Closed

External Links

Download PDF (Free) View on DOI.org Semantic Scholar

Social Impact

Altmetric

An Overview of Overfitting and its Solutions

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

2055

OpenAlex

Influential

1478

CrossRef

Cite This

APA Style

                            
                                    Ying Xue
                                
                            (2019). 
                            An Overview of Overfitting and its Solutions. 
                            Journal of Physics Conference Series
                            , 1168
                            
                            , 022022-022022.
                            https://doi.org/10.1088/1742-6596/1168/2/022022

Identifiers

DOI: 10.1088/1742-6596/1168/2/022022

Data Quality

Data completeness: 81%