Abstract

Neural networks have enabled state-of-the-art approaches to achieve incredible results on computer vision tasks such as object detection. However, such success greatly relies on costly computation resources, which hinders people with cheap devices from appreciating the advanced technology. In this paper, we propose Cross Stage Partial Network (CSPNet) to mitigate the problem that previous works require heavy inference computations from the network architecture perspective. We attribute the problem to the duplicate gradient information within network optimization. The proposed networks respect the variability of the gradients by integrating feature maps from the beginning and the end of a network stage, which, in our experiments, reduces computations by 20% with equivalent or even superior accuracy on the ImageNet dataset, and significantly outperforms state-of-the-art approaches in terms of AP <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">50</inf> on the MS COCO object detection dataset. The CSPNet is easy to implement and general enough to cope with architectures based on ResNet, ResNeXt, and DenseNet.

Keywords

Computer scienceInferenceComputationArtificial intelligenceFeature (linguistics)Backbone networkObject detectionObject (grammar)Feature engineeringMachine learningPerspective (graphical)State (computer science)ArchitectureDeep learningData miningPattern recognition (psychology)AlgorithmComputer network

Affiliated Institutions

Related Publications

HIPERLAN, applications and requirements

This paper traces the origins and applications of HIPERLAN, a system for high speed short range radio networks for computers. HIPERLAN, HIgh Performance European Radio LAN, is t...

2003 18 citations

Publication Info

Year
2020
Type
article
Citations
4309
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

4309
OpenAlex

Cite This

Chien-Yao Wang, Hong-Yuan Mark Liao, Yueh-Hua Wu et al. (2020). CSPNet: A New Backbone that can Enhance Learning Capability of CNN. . https://doi.org/10.1109/cvprw50498.2020.00203

Identifiers

DOI
10.1109/cvprw50498.2020.00203