Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network

Abstract

Recently, several models based on deep neural networks have achieved great success in terms of both reconstruction accuracy and computational performance for single image super-resolution. In these methods, the low resolution (LR) input image is upscaled to the high resolution (HR) space using a single filter, commonly bicubic interpolation, before reconstruction. This means that the super-resolution (SR) operation is performed in HR space. We demonstrate that this is sub-optimal and adds computational complexity. In this paper, we present the first convolutional neural network (CNN) capable of real-time SR of 1080p videos on a single K2 GPU. To achieve this, we propose a novel CNN architecture where the feature maps are extracted in the LR space. In addition, we introduce an efficient sub-pixel convolution layer which learns an array of upscaling filters to upscale the final LR feature maps into the HR output. By doing so, we effectively replace the handcrafted bicubic filter in the SR pipeline with more complex upscaling filters specifically trained for each feature map, whilst also reducing the computational complexity of the overall SR operation. We evaluate the proposed approach using images and videos from publicly available datasets and show that it performs significantly better (+0.15dB on Images and +0.39dB on Videos) and is an order of magnitude faster than previous CNN-based methods.

Keywords

Computer scienceBicubic interpolationConvolutional neural networkArtificial intelligenceFeature (linguistics)Pipeline (software)Convolution (computer science)Computational complexity theoryInterpolation (computer graphics)Computer visionPixelImage resolutionDeep learningUpsamplingPattern recognition (psychology)Filter (signal processing)Artificial neural networkImage (mathematics)AlgorithmLinear interpolation

Related Publications

Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding

Marco Bevilacqua , Aline Roumy , Christine Guillemot

This paper describes a single-image super-resolution (SR) algorithm based on nonnegative neighbor embedding. It belongs to the family of single-image example-based SR algorithms...

2012 2534 citations

Learning a Single Convolutional Super-Resolution Network for Multiple Degradations

Kai Zhang , Wangmeng Zuo , Lei Zhang

Recent years have witnessed the unprecedented success of deep convolutional neural networks (CNNs) in single image super-resolution (SISR). However, existing CNN-based SISR meth...

2018 1089 citations

Space-time super-resolution from a single video

Oded Shahar , Alon Faktor , Michal Irani

Spatial Super Resolution (SR) aims to recover fine image details, smaller than a pixel size. Temporal SR aims to recover rapid dynamic events that occur faster than the video fr...

2011 121 citations

Residual Dense Network for Image Super-Resolution

Yulun Zhang , Yapeng Tian , Yu Kong +2 more

A very deep convolutional neural network (CNN) has recently achieved great success for image super-resolution (SR) and offered hierarchical features as well. However, most deep ...

2018 3866 citations

Image Super-Resolution With Sparse Neighbor Embedding

Xinbo Gao , Kaibing Zhang , Dacheng Tao +1 more

Until now, neighbor-embedding-based (NE) algorithms for super-resolution (SR) have carried out two independent processes to synthesize high-resolution (HR) image patches. In the...

2012 IEEE Transactions on Image Processing 312 citations

Publication Info

Year: 2016
Type: article
Pages: 1874-1883
Citations: 6731
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

6731

OpenAlex

Cite This

APA Style

                            
                                    Wenzhe Shi, 
                                
                                    José Caballero, 
                                
                                    Ferenc Huszár
                                
                                et al.
                            
                            (2016). 
                            Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network. 
                            
                            , 1874-1883.
                            https://doi.org/10.1109/cvpr.2016.207

Identifiers

DOI: 10.1109/cvpr.2016.207