Abstract

Most modern convolutional neural networks (CNNs) used for object recognition are built using the same principles: Alternating convolution and max-pooling layers followed by a small number of fully connected layers. We re-evaluate the state of the art for object recognition from small images with convolutional networks, questioning the necessity of different components in the pipeline. We find that max-pooling can simply be replaced by a convolutional layer with increased stride without loss in accuracy on several image recognition benchmarks. Following this finding -- and building on other recent work for finding simple network structures -- we propose a new architecture that consists solely of convolutional layers and yields competitive or state of the art performance on several object recognition datasets (CIFAR-10, CIFAR-100, ImageNet). To analyze the network we introduce a new variant of the "deconvolution approach" for visualizing features learned by CNNs, which can be applied to a broader range of network structures than existing approaches.

Keywords

Convolutional neural networkPoolingComputer scienceConvolution (computer science)Pipeline (software)Artificial intelligenceDeconvolutionPattern recognition (psychology)Net (polyhedron)Object (grammar)Cognitive neuroscience of visual object recognitionContextual image classificationLayer (electronics)Image (mathematics)AlgorithmArtificial neural networkMathematics

Affiliated Institutions

Related Publications

Network In Network

Abstract: We propose a novel deep network structure called In Network (NIN) to enhance model discriminability for local patches within the receptive field. The conventional con...

2014 arXiv (Cornell University) 1037 citations

Fractional Max-Pooling

Convolutional networks almost always incorporate some form of spatial pooling, and very often it is alpha times alpha max-pooling with alpha=2. Max-pooling act on the hidden lay...

2014 arXiv (Cornell University) 335 citations

Publication Info

Year
2014
Type
preprint
Citations
2592
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

2592
OpenAlex

Cite This

Jost Tobias Springenberg, Alexey Dosovitskiy, Thomas Brox et al. (2014). Striving for Simplicity: The All Convolutional Net. arXiv (Cornell University) . https://doi.org/10.48550/arxiv.1412.6806

Identifiers

DOI
10.48550/arxiv.1412.6806