ResNeSt: Split-Attention Networks | RDL Research Database

Abstract

The ability to learn richer network representations generally boosts the performance of deep learning models. To improve representation-learning in convolutional neural networks, we present a multi-branch architecture, which applies channel-wise attention across different network branches to leverage the complementary strengths of both feature-map attention and multi-path representation. Our proposed Split-Attention module provides a simple and modular computation block that can serve as a drop-in replacement for the popular residual block, while producing more diverse representations via cross-feature interactions. Adding a Split-Attention module into the architecture design space of RegNet-Y and FBNetV2 directly improves the performance of the resulting network. Replacing residual blocks with our Split-Attention module, we further design a new variant of the ResNet model, named ResNeSt, which outperforms EfficientNet in terms of the accuracy/latency trade-off.

Keywords

Computer scienceLeverage (statistics)ResidualModular designFeature learningArtificial intelligenceConvolutional neural networkBlock (permutation group theory)Representation (politics)ArchitectureComputationLatency (audio)Deep learningResidual neural networkTheoretical computer scienceComputer engineeringAlgorithm

Affiliated Institutions

Related Publications

Second-Order Attention Network for Single Image Super-Resolution

Tao Dai , Jianrui Cai , Yongbing Zhang +2 more

Recently, deep convolutional neural networks (CNNs) have been widely explored in single image super-resolution (SISR) and obtained remarkable performance. However, most of the e...

2019 2019 IEEE/CVF Conference on Computer ... 1811 citations

FFA-Net: Feature Fusion Attention Network for Single Image Dehazing

Xiaodong Xie , Huizhu Jia , Qin Xu +2 more

In this paper, we propose an end-to-end feature fusion at-tention network (FFA-Net) to directly restore the haze-free image. The FFA-Net architecture consists of three key compo...

2020 Proceedings of the AAAI Conference on... 1458 citations

RepVGG: Making VGG-style ConvNets Great Again

Xiaohan Ding , Xiangyu Zhang , Ningning Ma +3 more

We present a simple but powerful architecture of convolutional neural network, which has a VGG-like inference-time body composed of nothing but a stack of 3 × 3 convolution and ...

2021 2021 IEEE/CVF Conference on Computer ... 2124 citations

Heterogeneous Graph Neural Network

Chuxu Zhang , Dongjin Song , Chao Huang +2 more

Representation learning in heterogeneous graphs aims to pursue a meaningful vector representation for each node so as to facilitate downstream applications such as link predicti...

2019 1375 citations

RGB-D Object Recognition via Incorporating Latent Data Structure and Prior Knowledge

Jinhui Tang , Lu Jin , Zechao Li +1 more

For the task of RGB-D object recognition, it is important to identify suitable representations of images, which can boost the performance of object recognition. In this work, we...

2015 IEEE Transactions on Multimedia 60 citations

Publication Info

Year: 2022
Type: article
Pages: 2735-2745
Citations: 1177
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

ResNeSt: Split-Attention Networks

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1177

OpenAlex

Cite This

APA Style

                            
                                    Hang Zhang, 
                                
                                    Chongruo Wu, 
                                
                                    Zhongyue Zhang
                                
                                et al.
                            
                            (2022). 
                            ResNeSt: Split-Attention Networks. 
                            2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
                            
                            , 2735-2745.
                            https://doi.org/10.1109/cvprw56347.2022.00309

Identifiers

DOI: 10.1109/cvprw56347.2022.00309