Abstract

We propose a framework for extracting structure from stereo which represents the scene as a collection of approximately planar layers. Each layer consists of an explicit 3D plane equation, a colored image with per-pixel opacity (a sprite), and a per-pixel depth offset relative to the plane. Initial estimates of the layers are recovered using techniques taken from parametric motion estimation. These initial estimates are then refined using a re-synthesis algorithm which takes into account both occlusions and mixed pixels. Reasoning about such effects allows the recovery of depth and color information with high accuracy even in partially occluded regions. Another important benefit of our framework is that the output consists of a collection of approximately planar regions, a representation which is far more appropriate than a dense depth map for many applications such as rendering and video parsing.

Keywords

PixelComputer visionArtificial intelligenceRendering (computer graphics)Computer scienceOffset (computer science)PlanarDepth mapImage-based modeling and renderingStereo displayReal-time renderingParametric statisticsParametric equationComputer graphics (images)MathematicsImage (mathematics)Geometry

Affiliated Institutions

Related Publications

Publication Info

Year
2002
Type
article
Pages
434-441
Citations
177
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

177
OpenAlex

Cite This

Simon Baker, Richard Szeliski, P. Anandan (2002). A layered approach to stereo reconstruction. , 434-441. https://doi.org/10.1109/cvpr.1998.698642

Identifiers

DOI
10.1109/cvpr.1998.698642