A survey on Image Data Augmentation for Deep Learning

Abstract

Abstract Deep convolutional neural networks have performed remarkably well on many Computer Vision tasks. However, these networks are heavily reliant on big data to avoid overfitting. Overfitting refers to the phenomenon when a network learns a function with very high variance such as to perfectly model the training data. Unfortunately, many application domains do not have access to big data, such as medical image analysis. This survey focuses on Data Augmentation, a data-space solution to the problem of limited data. Data Augmentation encompasses a suite of techniques that enhance the size and quality of training datasets such that better Deep Learning models can be built using them. The image augmentation algorithms discussed in this survey include geometric transformations, color space augmentations, kernel filters, mixing images, random erasing, feature space augmentation, adversarial training, generative adversarial networks, neural style transfer, and meta-learning. The application of augmentation methods based on GANs are heavily covered in this survey. In addition to augmentation techniques, this paper will briefly discuss other characteristics of Data Augmentation such as test-time augmentation, resolution impact, final dataset size, and curriculum learning. This survey will present existing methods for Data Augmentation, promising developments, and meta-level decisions for implementing Data Augmentation. Readers will understand how Data Augmentation can improve the performance of their models and expand limited datasets to take advantage of the capabilities of big data.

Keywords

Computer scienceOverfittingDeep learningArtificial intelligenceMachine learningBig dataArtificial neural networkTransfer of learningConvolutional neural networkBenchmark (surveying)Data scienceData mining

Affiliated Institutions

Florida Atlantic University US

Related Publications

Generative Adversarial Networks: An Overview

Antonia Creswell , Tom White , Vincent Dumoulin +3 more

Generative adversarial networks (GANs) provide a way to learn deep representations without extensively annotated training data. They achieve this by deriving backpropagation sig...

2018 IEEE Signal Processing Magazine 4073 citations

Fractional Max-Pooling

Benjamin Graham

Convolutional networks almost always incorporate some form of spatial pooling, and very often it is alpha times alpha max-pooling with alpha=2. Max-pooling act on the hidden lay...

2014 arXiv (Cornell University) 335 citations

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe , Christian Szegedy

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. T...

2024 arXiv (Cornell University) 15635 citations

Network In Network

Min Lin , Qiang Chen , Shuicheng Yan

Abstract: We propose a novel deep network structure called In Network (NIN) to enhance model discriminability for local patches within the receptive field. The conventional con...

2014 arXiv (Cornell University) 1037 citations

CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features

Sangdoo Yun , Dongyoon Han , Sanghyuk Chun +3 more

Regional dropout strategies have been proposed to enhance performance of convolutional neural network classifiers. They have proved to be effective for guiding the model to atte...

2019 4293 citations

Publication Info

Year: 2019
Type: article
Volume: 6
Issue: 1
Citations: 11041
Access: Closed

External Links

Download PDF (Free) View on DOI.org Semantic Scholar

Social Impact

Altmetric

A survey on Image Data Augmentation for Deep Learning

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

11041

OpenAlex

251

Influential

Cite This

APA Style

                            
                                    Connor Shorten, 
                                
                                    Taghi M. Khoshgoftaar
                                
                            (2019). 
                            A survey on Image Data Augmentation for Deep Learning. 
                            Journal Of Big Data
                            , 6
                            (1)
                            .
                            https://doi.org/10.1186/s40537-019-0197-0

Identifiers

DOI: 10.1186/s40537-019-0197-0

Data Quality

Data completeness: 81%