DenseASPP for Semantic Segmentation in Street Scenes

Abstract

Semantic image segmentation is a basic street scene understanding task in autonomous driving, where each pixel in a high resolution image is categorized into a set of semantic labels. Unlike other scenarios, objects in autonomous driving scene exhibit very large scale changes, which poses great challenges for high-level feature representation in a sense that multi-scale information must be correctly encoded. To remedy this problem, atrous convolution[14]was introduced to generate features with larger receptive fields without sacrificing spatial resolution. Built upon atrous convolution, Atrous Spatial Pyramid Pooling (ASPP)[2] was proposed to concatenate multiple atrous-convolved features using different dilation rates into a final feature representation. Although ASPP is able to generate multi-scale features, we argue the feature resolution in the scale-axis is not dense enough for the autonomous driving scenario. To this end, we propose Densely connected Atrous Spatial Pyramid Pooling (DenseASPP), which connects a set of atrous convolutional layers in a dense way, such that it generates multi-scale features that not only cover a larger scale range, but also cover that scale range densely, without significantly increasing the model size. We evaluate DenseASPP on the street scene benchmark Cityscapes[4] and achieve state-of-the-art performance.

Keywords

Artificial intelligenceComputer sciencePyramid (geometry)Computer visionSegmentationPoolingPattern recognition (psychology)Feature (linguistics)Scale (ratio)Convolution (computer science)Image resolutionFeature extractionMathematicsGeographyCartographyArtificial neural networkGeometry

Related Publications

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Liang-Chieh Chen , George Papandreou , Iasonas Kokkinos +2 more

In this work we address the task of semantic image segmentation with Deep Learning and make three main contributions that are experimentally shown to have substantial practical ...

2017 IEEE Transactions on Pattern Analysis... 20855 citations

Rethinking Atrous Convolution for Semantic Image Segmentation

Liang-Chieh Chen , George Papandreou , Florian Schroff +1 more

In this work, we revisit atrous convolution, a powerful tool to explicitly adjust filter's field-of-view as well as control the resolution of feature responses computed by Deep ...

2017 arXiv (Cornell University) 7401 citations

Context Encoding for Semantic Segmentation

Hang Zhang , Kristin Dana , Jianping Shi +4 more

Recent work has made significant progress in improving spatial resolution for pixelwise labeling with Fully Convolutional Network (FCN) framework by employing Dilated/Atrous con...

2018 2018 IEEE/CVF Conference on Computer ... 1436 citations

RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation

Guosheng Lin , Anton Milan , Chunhua Shen +1 more

Recently, very deep convolutional neural networks (CNNs) have shown outstanding performance in object recognition and have also been the first choice for dense classification pr...

2017 3120 citations

DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution

Siyuan Qiao , Liang-Chieh Chen , Alan Yuille

Many modern object detectors demonstrate outstanding performances by using the mechanism of looking and thinking twice. In this paper, we explore this mechanism in the backbone ...

2021 942 citations

Publication Info

Year: 2018
Type: article
Citations: 1566
Access: Closed

External Links

Download PDF (Free) View on DOI.org Semantic Scholar

Social Impact

Altmetric

DenseASPP for Semantic Segmentation in Street Scenes

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1566

OpenAlex

130

Influential

1207

CrossRef

Cite This

APA Style

                            
                                    Maoke Yang, 
                                
                                    Kun Yu, 
                                
                                    Chi Zhang
                                
                                et al.
                            
                            (2018). 
                            DenseASPP for Semantic Segmentation in Street Scenes. 
                            2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
                            
                            .
                            https://doi.org/10.1109/cvpr.2018.00388

Identifiers

DOI: 10.1109/cvpr.2018.00388

Data Quality

Data completeness: 77%