Abstract

Recently, Neural Architecture Search (NAS) has successfully identified neural network architectures that exceed human designed ones on large-scale image classification. In this paper, we study NAS for semantic image segmentation. Existing works often focus on searching the repeatable cell structure, while hand-designing the outer network structure that controls the spatial resolution changes. This choice simplifies the search space, but becomes increasingly problematic for dense image prediction which exhibits a lot more network level architectural variations. Therefore, we propose to search the network level structure in addition to the cell level structure, which forms a hierarchical architecture search space. We present a network level search space that includes many popular designs, and develop a formulation that allows efficient gradient-based architecture search (3 P100 GPU days on Cityscapes images). We demonstrate the effectiveness of the proposed method on the challenging Cityscapes, PASCAL VOC 2012, and ADE20K datasets. Auto-DeepLab, our architecture searched specifically for semantic image segmentation, attains state-of-the-art performance without any ImageNet pretraining.

Keywords

Computer sciencePascal (unit)Artificial intelligenceSegmentationArtificial neural networkArchitectureImage segmentationImage (mathematics)Network architecturePattern recognition (psychology)Machine learning

Affiliated Institutions

Related Publications

Publication Info

Year
2019
Type
article
Pages
82-92
Citations
1043
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1043
OpenAlex

Cite This

Chenxi Liu, Liang-Chieh Chen, Florian Schroff et al. (2019). Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation. , 82-92. https://doi.org/10.1109/cvpr.2019.00017

Identifiers

DOI
10.1109/cvpr.2019.00017