ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks

Abstract

Recently, channel attention mechanism has demonstrated to offer great potential in improving the performance of deep convolutional neural networks (CNNs). However, most existing methods dedicate to developing more sophisticated attention modules for achieving better performance, which inevitably increase model complexity. To overcome the paradox of performance and complexity trade-off, this paper proposes an Efficient Channel Attention (ECA) module, which only involves a handful of parameters while bringing clear performance gain. By dissecting the channel attention module in SENet, we empirically show avoiding dimensionality reduction is important for learning channel attention, and appropriate cross-channel interaction can preserve performance while significantly decreasing model complexity. Therefore, we propose a local cross-channel interaction strategy without dimensionality reduction, which can be efficiently implemented via 1D convolution. Furthermore, we develop a method to adaptively select kernel size of 1D convolution, determining coverage of local cross-channel interaction. The proposed ECA module is both efficient and effective, e.g., the parameters and computations of our modules against backbone of ResNet50 are 80 vs. 24.37M and 4.7e-4 GFlops vs. 3.86 GFlops, respectively, and the performance boost is more than 2% in terms of Top-1 accuracy. We extensively evaluate our ECA module on image classification, object detection and instance segmentation with backbones of ResNets and MobileNetV2. The experimental results show our module is more efficient while performing favorably against its counterparts.

Keywords

FLOPSComputer scienceConvolutional neural networkKernel (algebra)Convolution (computer science)Channel (broadcasting)Artificial intelligenceComputational complexity theoryCurse of dimensionalityComputationComputer engineeringReduction (mathematics)Performance improvementPattern recognition (psychology)Machine learningArtificial neural networkAlgorithmParallel computingTelecommunications

Affiliated Institutions

Related Publications

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

Liang-Chieh Chen , Yukun Zhu , George Papandreou +2 more

Spatial pyramid pooling module or encode-decoder structure are used in deep neural networks for semantic segmentation task. The former networks are able to encode multi-scale co...

2018 Lecture notes in computer science 13300 citations

Dual Attention Network for Scene Segmentation

Jun Fu , Jing Liu , Haijie Tian +4 more

In this paper, we address the scene segmentation task by capturing rich contextual dependencies based on the self-attention mechanism. Unlike previous works that capture context...

2019 2019 IEEE/CVF Conference on Computer ... 6497 citations

ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

Xiangyu Zhang , Xinyu Zhou , Mengxiao Lin +1 more

We introduce an extremely computation-efficient CNN architecture named ShuffleNet, which is designed specially for mobile devices with very limited computing power (e.g., 10-150...

2018 2018 IEEE/CVF Conference on Computer ... 8394 citations

GhostNet: More Features From Cheap Operations

Kai Han , Yunhe Wang , Qi Tian +3 more

Deploying convolutional neural networks (CNNs) on embedded devices is difficult due to the limited memory and computation resources. The redundancy in feature maps is an importa...

2020 3894 citations

CBAM: Convolutional Block Attention Module

Sanghyun Woo , Jongchan Park , Joon‐Young Lee +1 more

We propose Convolutional Block Attention Module (CBAM), a simple yet effective attention module for feed-forward convolutional neural networks. Given an intermediate feature map...

2018 Lecture notes in computer science 20102 citations

Publication Info

Year: 2020
Type: article
Pages: 11531-11539
Citations: 6942
Access: Closed

External Links

Download PDF (Free) View on DOI.org arXiv Semantic Scholar

Social Impact

Altmetric

ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

6942

OpenAlex

442

Influential

6130

CrossRef

Cite This

APA Style

                            
                                    Qilong Wang, 
                                
                                    Banggu Wu, 
                                
                                    Pengfei Zhu
                                
                                et al.
                            
                            (2020). 
                            ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. 
                            2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
                            
                            , 11531-11539.
                            https://doi.org/10.1109/cvpr42600.2020.01155

Identifiers

DOI: 10.1109/cvpr42600.2020.01155
arXiv: 1910.03151

Data Quality

Data completeness: 84%