Abstract

Feature pyramids are a basic component in recognition systems for detecting objects at different scales. But pyramid representations have been avoided in recent object detectors that are based on deep convolutional networks, partially because they are slow to compute and memory intensive. In this paper, we exploit the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost. A top-down architecture with lateral connections is developed for building high-level semantic feature maps at all scales. This architecture, called a Feature Pyramid Network (FPN), shows significant improvement as a generic feature extractor in several applications. Using a basic Faster R-CNN system, our method achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles, surpassing all existing single-model entries including those from the COCO 2016 challenge winners. In addition, our method can run at 5 FPS on a GPU and thus is a practical and accurate solution to multi-scale object detection. Code will be made publicly available.

Keywords

Pyramid (geometry)Computer scienceFeature (linguistics)Object detectionBenchmark (surveying)Artificial intelligenceFeature extractionConvolutional neural networkPattern recognition (psychology)Construct (python library)ExploitSemantic featureObject (grammar)HierarchyMathematicsProgramming language

Affiliated Institutions

Related Publications

Non-local Neural Networks

Both convolutional and recurrent operations are building blocks that process one local neighborhood at a time. In this paper, we present non-local operations as a generic family...

2018 2018 IEEE/CVF Conference on Computer ... 10740 citations

Publication Info

Year
2017
Type
preprint
Pages
936-944
Citations
26836
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

26836
OpenAlex
3102
Influential
21347
CrossRef

Cite This

Tsung-Yi Lin, Piotr Dollár, Ross Girshick et al. (2017). Feature Pyramid Networks for Object Detection. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 936-944. https://doi.org/10.1109/cvpr.2017.106

Identifiers

DOI
10.1109/cvpr.2017.106
arXiv
1612.03144

Data Quality

Data completeness: 84%