From Virtual to Reality: Fast Adaptation of Virtual Object Detectors to Real Domains

Abstract

The most successful 2D object detection methods require a large number of images annotated with object bounding boxes to be collected for training. We present an alternative approach that trains on virtual data rendered from 3D models, avoiding the need for manual labeling. Growing demand for virtual reality applications is quickly bringing about an abundance of available 3D models for a large variety of object categories. While mainstream use of 3D models in vision has focused on predicting the 3D pose of objects, we investigate the use of such freely available 3D models for multicategory 2D object detection. To address the issue of dataset bias that arises from training on virtual data and testing on real images, we propose a simple and fast adaptation approach based on decorrelated features. We also compare two kinds of virtual data, one rendered with real-image textures and one without. Evaluation on a benchmark domain adaptation dataset demonstrates that our method performs comparably to existing methods trained on large-scale real image domains.

Keywords

Computer scienceObject detectionBenchmark (surveying)Virtual realityArtificial intelligenceAdaptation (eye)Object (grammar)Computer visionMinimum bounding boxDomain (mathematical analysis)Bounding overwatchVirtual imageMachine learningImage (mathematics)Pattern recognition (psychology)

Affiliated Institutions

University of Massachusetts Lowell US

Related Publications

End-to-End Recovery of Human Shape and Pose

Angjoo Kanazawa , Michael J. Black , David W. Jacobs +1 more

We describe Human Mesh Recovery (HMR), an end-to-end framework for reconstructing a full 3D mesh of a human body from a single RGB image. In contrast to most current methods tha...

2018 2018 IEEE/CVF Conference on Computer ... 1811 citations

Deformable Part Descriptors for Fine-Grained Recognition and Attribute Prediction

Ning Zhang , Ryan Farrell , Forrest Iandola +1 more

Recognizing objects in fine-grained domains can be extremely challenging due to the subtle differences between subcategories. Discriminative markings are often highly localized,...

2013 201 citations

PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes

Venkatraman Narayanan , Dieter Fox , Xiang Yu +1 more

Estimating the 6D pose of known objects is important for robots to interact with the real world.The problem is challenging due to the variety of objects as well as the complexit...

2018 Robotics: Science and Systems XIV 1986 citations

Learning to Generalize: Meta-Learning for Domain Generalization

Da Li , Yongxin Yang , Yi-Zhe Song +1 more

Domain shift refers to the well known problem that a model trained in one source domain performs poorly when appliedto a target domain with different statistics. Domain Generali...

2018 Proceedings of the AAAI Conference on... 1146 citations

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

Alina Kuznetsova , Hassan Rom , Neil Alldrin +9 more

We present Open Images V4, a dataset of 9.2M images with unified annotations for image classification, object detection and visual relationship detection. The images have a Crea...

2018 arXiv (Cornell University) 1429 citations

Publication Info

Year: 2014
Type: article
Pages: 82.1-82.12
Citations: 167
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

From Virtual to Reality: Fast Adaptation of Virtual Object Detectors to Real Domains

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

167

OpenAlex

Cite This

APA Style

                            
                                    Baochen Sun, 
                                
                                    Kate Saenko
                                
                            (2014). 
                            From Virtual to Reality: Fast Adaptation of Virtual Object Detectors to Real Domains. 
                            
                            , 82.1-82.12.
                            https://doi.org/10.5244/c.28.82

Identifiers

DOI: 10.5244/c.28.82