EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

Abstract

Convolutional Neural Networks (ConvNets) are commonly developed at a fixed resource budget, and then scaled up for better accuracy if more resources are available. In this paper, we systematically study model scaling and identify that carefully balancing network depth, width, and resolution can lead to better performance. Based on this observation, we propose a new scaling method that uniformly scales all dimensions of depth/width/resolution using a simple yet highly effective compound coefficient. We demonstrate the effectiveness of this method on scaling up MobileNets and ResNet. To go even further, we use neural architecture search to design a new baseline network and scale it up to obtain a family of models, called EfficientNets, which achieve much better accuracy and efficiency than previous ConvNets. In particular, our EfficientNet-B7 achieves state-of-the-art 84.3% top-1 accuracy on ImageNet, while being 8.4x smaller and 6.1x faster on inference than the best existing ConvNet. Our EfficientNets also transfer well and achieve state-of-the-art accuracy on CIFAR-100 (91.7%), Flowers (98.8%), and 3 other transfer learning datasets, with an order of magnitude fewer parameters. Source code is at https://github.com/tensorflow/tpu/tree/master/models/official/efficientnet.

Keywords

ScalingComputer scienceInferenceConvolutional neural networkCode (set theory)Transfer of learningScale (ratio)AlgorithmArtificial neural networkResolution (logic)Artificial intelligenceMachine learningPattern recognition (psychology)Mathematics

Affiliated Institutions

Google (United States) US

Related Publications

EfficientDet: Scalable and Efficient Object Detection

Mingxing Tan , Ruoming Pang , Quoc V. Le

Model efficiency has become increasingly important in computer vision. In this paper, we systematically study neural network architecture design choices for object detection and...

2020 2020 IEEE/CVF Conference on Computer ... 7436 citations

Going deeper with convolutions

Christian Szegedy , Wei Liu , Yangqing Jia +6 more

We propose a deep convolutional neural network architecture codenamed Inception that achieves the new state of the art for classification and detection in the ImageNet Large-Sca...

2015 45596 citations

Image Super-Resolution Using Very Deep Residual Channel Attention Networks

Yulun Zhang , Kunpeng Li , Kai Li +3 more

Convolutional neural network (CNN) depth is of crucial importance for image super-resolution (SR). However, we observe that deeper networks for image SR are more difficult to tr...

2018 Lecture notes in computer science 5131 citations

Image Super-Resolution Using Deep Convolutional Networks

Chao Dong , Chen Change Loy , Kaiming He +1 more

We propose a deep learning method for single image super-resolution (SR). Our method directly learns an end-to-end mapping between the low/high-resolution images. The mapping is...

2015 IEEE Transactions on Pattern Analysis... 9271 citations

SlowFast Networks for Video Recognition

Christoph Feichtenhofer , Haoqi Fan , Jitendra Malik +1 more

We present SlowFast networks for video recognition. Our model involves (i) a Slow pathway, operating at low frame rate, to capture spatial semantics, and (ii) a Fast pathway, op...

2019 3322 citations

Publication Info

Year: 2019
Type: preprint
Citations: 5008
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

5008

OpenAlex

Cite This

APA Style

                            
                                    Mingxing Tan, 
                                
                                    Quoc V. Le
                                
                            (2019). 
                            EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. 
                            arXiv (Cornell University)
                            
                            .
                            https://doi.org/10.48550/arxiv.1905.11946

Identifiers

DOI: 10.48550/arxiv.1905.11946

Data Quality

Data completeness: 77%