Convolutional feature masking for joint object and stuff segmentation

Abstract

The topic of semantic segmentation has witnessed considerable progress due to the powerful features learned by convolutional neural networks (CNNs). The current leading approaches for semantic segmentation exploit shape information by extracting CNN features from masked image regions. This strategy introduces artificial boundaries on the images and may impact the quality of the extracted features. Besides, the operations on the raw image domain require to compute thousands of networks on a single image, which is time-consuming. In this paper, we propose to exploit shape information via masking convolutional features. The proposal segments (e.g., super-pixels) are treated as masks on the convolutional feature maps. The CNN features of segments are directly masked out from these maps and used to train classifiers for recognition. We further propose a joint method to handle objects and "stuff" (e.g., grass, sky, water) in the same framework. State-of-the-art results are demonstrated on benchmarks of PASCAL VOC and new PASCAL-CONTEXT, with a compelling computational speed.

Keywords

Computer sciencePascal (unit)Artificial intelligenceConvolutional neural networkSegmentationExploitPattern recognition (psychology)Masking (illustration)Feature (linguistics)Computer visionImage segmentationPixelFeature extractionObject detection

Affiliated Institutions

Microsoft (United States) US

Related Publications

Object Detection via a Multi-region and Semantic Segmentation-Aware CNN Model

Spyros Gidaris , Nikos Komodakis

We propose an object detection system that relies on a multi-region deep convolutional neural network (CNN) that also encodes semantic segmentation-aware features. The resulting...

2015 799 citations

Scene Segmentation with DAG-Recurrent Neural Networks

Bing Shuai , Zhen Zuo , Bing Wang +1 more

In this paper, we address the challenging task of scene segmentation. In order to capture the rich contextual dependencies over image regions, we propose Directed Acyclic Graph-...

2017 IEEE Transactions on Pattern Analysis... 143 citations

Dual Attention Network for Scene Segmentation

Jun Fu , Jing Liu , Haijie Tian +4 more

In this paper, we address the scene segmentation task by capturing rich contextual dependencies based on the self-attention mechanism. Unlike previous works that capture context...

2019 6497 citations

HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection

Tao Kong , Anbang Yao , Yurong Chen +1 more

Almost all of the current top-performing object detection networks employ region proposals to guide the search for object instances. State-of-the-art region proposal methods usu...

2016 833 citations

Residual Conv-Deconv Grid Network for Semantic Segmentation

Damien Fourure , Rémi Emonet , Élisa Fromont +3 more

This paper presents GridNet, a new Convolutional Neural Network (CNN)\narchitecture for semantic image segmentation (full scene labelling). Classical\nneural networks are implem...

2017 211 citations

Publication Info

Year: 2015
Type: preprint
Citations: 466
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Convolutional feature masking for joint object and stuff segmentation

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

466

OpenAlex

Cite This

APA Style

                            
                                    Jifeng Dai, 
                                
                                    Kaiming He, 
                                
                                    Jian Sun
                                
                            (2015). 
                            Convolutional feature masking for joint object and stuff segmentation. 
                            
                            .
                            https://doi.org/10.1109/cvpr.2015.7299025

Identifiers

DOI: 10.1109/cvpr.2015.7299025