Abstract

Semantic segmentation and object detection research have recently achieved rapid progress. However, the former task has no notion of different instances of the same object, and the latter operates at a coarse, bounding-box level. We propose an Instance Segmentation system that produces a segmentation map where each pixel is assigned an object class and instance identity label. Most approaches adapt object detectors to produce segments instead of boxes. In contrast, our method is based on an initial semantic segmentation module, which feeds into an instance subnetwork. This subnetwork uses the initial category-level segmentation, along with cues from the output of an object detector, within an end-to-end CRF to predict instances. This part of our model is dynamically instantiated to produce a variable number of instances per image. Our end-to-end approach requires no post-processing and considers the image holistically, instead of processing independent proposals. Therefore, unlike some related work, a pixel cannot belong to multiple instances. Furthermore, far more precise segmentations are achieved, as shown by our substantial improvements at high APr thresholds.

Keywords

SubnetworkSegmentationComputer scienceArtificial intelligenceMinimum bounding boxObject (grammar)PixelObject detectionImage segmentationComputer visionSegmentation-based object categorizationPattern recognition (psychology)Scale-space segmentationBounding overwatchTask (project management)Class (philosophy)Image (mathematics)

Affiliated Institutions

Related Publications

Publication Info

Year
2017
Type
article
Pages
879-888
Citations
241
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

241
OpenAlex

Cite This

Anurag Arnab, Philip H. S. Torr (2017). Pixelwise Instance Segmentation with a Dynamically Instantiated Network. , 879-888. https://doi.org/10.1109/cvpr.2017.100

Identifiers

DOI
10.1109/cvpr.2017.100