Fully Convolutional Networks for Semantic Segmentation

Abstract

Convolutional networks are powerful visual models that yield hierarchies of features. We show that convolutional networks by themselves, trained end-to-end, pixels-to-pixels, improve on the previous best result in semantic segmentation. Our key insight is to build "fully convolutional" networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning. We define and detail the space of fully convolutional networks, explain their application to spatially dense prediction tasks, and draw connections to prior models. We adapt contemporary classification networks (AlexNet, the VGG net, and GoogLeNet) into fully convolutional networks and transfer their learned representations by fine-tuning to the segmentation task. We then define a skip architecture that combines semantic information from a deep, coarse layer with appearance information from a shallow, fine layer to produce accurate and detailed segmentations. Our fully convolutional networks achieve improved segmentation of PASCAL VOC (30% relative improvement to 67.2% mean IU on 2012), NYUDv2, SIFT Flow, and PASCAL-Context, while inference takes one tenth of a second for a typical image.

Keywords

Computer scienceArtificial intelligenceSegmentationConvolutional neural networkPascal (unit)Pattern recognition (psychology)InferencePixelDeep learning

Affiliated Institutions

University of California, Berkeley US

Related Publications

Fully convolutional networks for semantic segmentation

Jonathan Long , Evan Shelhamer , Trevor Darrell

Convolutional networks are powerful visual models that yield hierarchies of features. We show that convolutional networks by themselves, trained end-to-end, pixels-to-pixels, ex...

2015 35498 citations

Learning Deconvolution Network for Semantic Segmentation

Hyeonwoo Noh , Seunghoon Hong , Bohyung Han

We propose a novel semantic segmentation algorithm by learning a deep deconvolution network. We learn the network on top of the convolutional layers adopted from VGG 16-layer ne...

2015 3978 citations

Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation

Guosheng Lin , Chunhua Shen , Anton van den Hengel +1 more

Recent advances in semantic image segmentation have mostly been achieved by training deep convolutional neural networks (CNNs). We show how to improve semantic segmentation thro...

2016 844 citations

Instance-Aware Semantic Segmentation via Multi-task Network Cascades

Jifeng Dai , Kaiming He , Jian Sun

Semantic segmentation research has recently witnessed rapid progress, but many leading methods are unable to identify object instances. In this paper, we present Multitask Netwo...

2016 1264 citations

RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation

Guosheng Lin , Anton Milan , Chunhua Shen +1 more

Recently, very deep convolutional neural networks (CNNs) have shown outstanding performance in object recognition and have also been the first choice for dense classification pr...

2017 3120 citations

Publication Info

Year: 2016
Type: article
Volume: 39
Issue: 4
Pages: 640-651
Citations: 10715
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Fully Convolutional Networks for Semantic Segmentation

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

10715

OpenAlex

Cite This

APA Style

                            
                                    Evan Shelhamer, 
                                
                                    Jonathan Long, 
                                
                                    Trevor Darrell
                                
                            (2016). 
                            Fully Convolutional Networks for Semantic Segmentation. 
                            IEEE Transactions on Pattern Analysis and Machine Intelligence
                            , 39
                            (4)
                            , 640-651.
                            https://doi.org/10.1109/tpami.2016.2572683

Identifiers

DOI: 10.1109/tpami.2016.2572683