YOLO9000: Better, Faster, Stronger | RDL Research Database

Abstract

We introduce YOLO9000, a state-of-the-art, real-time object detection system that can detect over 9000 object categories. First we propose various improvements to the YOLO detection method, both novel and drawn from prior work. The improved model, YOLOv2, is state-of-the-art on standard detection tasks like PASCAL VOC and COCO. Using a novel, multi-scale training method the same YOLOv2 model can run at varying sizes, offering an easy tradeoff between speed and accuracy. At 67 FPS, YOLOv2 gets 76.8 mAP on VOC 2007. At 40 FPS, YOLOv2 gets 78.6 mAP, outperforming state-of-the-art methods like Faster RCNN with ResNet and SSD while still running significantly faster. Finally we propose a method to jointly train on object detection and classification. Using this method we train YOLO9000 simultaneously on the COCO detection dataset and the ImageNet classification dataset. Our joint training allows YOLO9000 to predict detections for object classes that dont have labelled detection data. We validate our approach on the ImageNet detection task. YOLO9000 gets 19.7 mAP on the ImageNet detection validation set despite only having detection data for 44 of the 200 classes. On the 156 classes not in COCO, YOLO9000 gets 16.0 mAP. YOLO9000 predicts detections for more than 9000 different object categories, all in real-time.

Keywords

Object detectionPascal (unit)Computer scienceArtificial intelligencePattern recognition (psychology)Object (grammar)Training setComputer vision

Affiliated Institutions

Related Publications

Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks

Maxime Oquab , Léon Bottou , Ivan Laptev +1 more

Convolutional neural networks (CNN) have recently shown outstanding image classification performance in the large- scale visual recognition challenge (ILSVRC2012). The success o...

2014 3151 citations

YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors

Chien-Yao Wang , Alexey Bochkovskiy , Hong-Yuan Mark Liao

Real-time object detection is one of the most important research topics in computer vision. As new approaches regarding architecture optimization and training optimization are c...

2023 2023 IEEE/CVF Conference on Computer ... 9475 citations

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick , Jeff Donahue , Trevor Darrell +1 more

Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that...

2014 30615 citations

CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features

Sangdoo Yun , Dongyoon Han , Sanghyuk Chun +3 more

Regional dropout strategies have been proposed to enhance performance of convolutional neural network classifiers. They have proved to be effective for guiding the model to atte...

2019 4293 citations

Is object localization for free? - Weakly-supervised learning with convolutional neural networks

Maxime Oquab , Léon Bottou , Ivan Laptev +1 more

Successful methods for visual object recognition typically rely on training datasets containing lots of richly annotated images. Detailed image annotation, e.g. by object boundi...

2015 915 citations

Publication Info

Year: 2017
Type: article
Pages: 6517-6525
Citations: 18283
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

YOLO9000: Better, Faster, Stronger

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

18283

OpenAlex

Cite This

APA Style

                            
                                    Joseph Redmon, 
                                
                                    Ali Farhadi
                                
                            (2017). 
                            YOLO9000: Better, Faster, Stronger. 
                            
                            , 6517-6525.
                            https://doi.org/10.1109/cvpr.2017.690

Identifiers

DOI: 10.1109/cvpr.2017.690