Abstract

The goal of this paper is to serve as a guide for selecting a detection architecture that achieves the right speed/memory/accuracy balance for a given application and platform. To this end, we investigate various ways to trade accuracy for speed and memory usage in modern convolutional object detection systems. A number of successful systems have been proposed in recent years, but apples-toapples comparisons are difficult due to different base feature extractors (e.g., VGG, Residual Networks), different default image resolutions, as well as different hardware and software platforms. We present a unified implementation of the Faster R-CNN [30], R-FCN [6] and SSD [25] systems, which we view as meta-architectures and trace out the speed/accuracy trade-off curve created by using alternative feature extractors and varying other critical parameters such as image size within each of these meta-architectures. On one extreme end of this spectrum where speed and memory are critical, we present a detector that achieves real time speeds and can be deployed on a mobile device. On the opposite end in which accuracy is critical, we present a detector that achieves state-of-the-art performance measured on the COCO detection task.

Keywords

Computer scienceDetectorObject detectionResidualFeature (linguistics)Convolutional neural networkSpeedupTRACE (psycholinguistics)SoftwareArtificial intelligenceReal-time computingPattern recognition (psychology)Parallel computingAlgorithm

Affiliated Institutions

Related Publications

Publication Info

Year
2017
Type
preprint
Pages
3296-3297
Citations
2604
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

2604
OpenAlex
263
Influential
1702
CrossRef

Cite This

Jonathan Huang, Vivek Rathod, Chen Sun et al. (2017). Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , 3296-3297. https://doi.org/10.1109/cvpr.2017.351

Identifiers

DOI
10.1109/cvpr.2017.351
arXiv
1611.10012

Data Quality

Data completeness: 84%