Abstract

Convolutional neural networks are built upon the convolution operation, which extracts informative features by fusing spatial and channel-wise information together within local receptive fields. In order to boost the representational power of a network, several recent approaches have shown the benefit of enhancing spatial encoding. In this work, we focus on the channel relationship and propose a novel architectural unit, which we term the "Squeeze-and-Excitation" (SE) block, that adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels. We demonstrate that by stacking these blocks together, we can construct SENet architectures that generalise extremely well across challenging datasets. Crucially, we find that SE blocks produce significant performance improvements for existing state-of-the-art deep architectures at minimal additional computational cost. SENets formed the foundation of our ILSVRC 2017 classification submission which won first place and significantly reduced the top-5 error to 2.251%, achieving a ~25% relative improvement over the winning entry of 2016. Code and models are available at https://github.com/hujie-frank/SENet.

Keywords

Computer scienceConvolution (computer science)Block (permutation group theory)Convolutional neural networkCode (set theory)Encoding (memory)Focus (optics)Construct (python library)Channel (broadcasting)Feature (linguistics)Decoding methodsArtificial intelligenceTheoretical computer sciencePattern recognition (psychology)Computer engineeringAlgorithmArtificial neural networkProgramming languageTelecommunications

Affiliated Institutions

Related Publications

Squeeze-and-Excitation Networks

The central building block of convolutional neural networks (CNNs) is the convolution operator, which enables networks to construct informative features by fusing both spatial a...

2019 IEEE Transactions on Pattern Analysis... 12023 citations

Publication Info

Year
2018
Type
preprint
Pages
7132-7141
Citations
25361
Access
Closed

Social Impact

Altmetric

Social media, news, blog, policy document mentions

Citation Metrics

25361
OpenAlex
2123
Influential
23961
CrossRef

Cite This

Jie Hu, Li Shen, Gang Sun (2018). Squeeze-and-Excitation Networks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition , 7132-7141. https://doi.org/10.1109/cvpr.2018.00745

Identifiers

DOI
10.1109/cvpr.2018.00745
arXiv
1709.01507

Data Quality

Data completeness: 88%