Abstract

3D scene understanding is key for the success of applications such as autonomous driving and robot navigation. However, existing approaches either produce a mild level of understanding, e.g., segmentation, object detection, or are not accurate enough for these applications, e.g., 3D pop-ups. In this paper we propose a principled generative model of 3D urban scenes that takes into account dependencies between static and dynamic features. We derive a reversible jump MCMC scheme that is able to infer the geometric (e.g., street orientation) and topological (e.g., number of intersecting streets) properties of the scene layout, as well as the semantic activities occurring in the scene, e.g., traffic situations at an intersection. Furthermore, we show that this global level of understanding provides the context necessary to disambiguate current state-of-the-art detectors. We demonstrate the effectiveness of our approach on a dataset composed of short stereo video sequences of 113 different scenes captured by a car driving around a mid-size city.

Keywords

Computer scienceArtificial intelligenceComputer visionIntersection (aeronautics)Context (archaeology)SegmentationGenerative modelDroneKey (lock)Object (grammar)RobotPedestrianGenerative grammarObject detectionGeographyCartography

Affiliated Institutions

Related Publications

A ConvNet for the 2020s

The "Roaring 20s" of visual recognition began with the introduction of Vision Transformers (ViTs), which quickly superseded ConvNets as the state-of-the-art image classification...

2022 2022 IEEE/CVF Conference on Computer ... 5683 citations

Publication Info

Year
2011
Type
article
Volume
220
Pages
1945-1952
Citations
57
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

57
OpenAlex

Cite This

Andreas Geiger, Martin Lauer, Raquel Urtasun (2011). A generative model for 3D urban scene understanding from movable platforms. , 220 , 1945-1952. https://doi.org/10.1109/cvpr.2011.5995641

Identifiers

DOI
10.1109/cvpr.2011.5995641