Deep convolutional neural fields for depth estimation from a single image

Abstract

We consider the problem of depth estimation from a sin- gle monocular image in this work. It is a challenging task as no reliable depth cues are available, e.g., stereo corre- spondences, motions etc. Previous efforts have been focus- ing on exploiting geometric priors or additional sources of information, with all using hand-crafted features. Recently, there is mounting evidence that features from deep convo- lutional neural networks (CNN) are setting new records for various vision applications. On the other hand, considering the continuous characteristic of the depth values, depth esti- mations can be naturally formulated into a continuous con- ditional random field (CRF) learning problem. Therefore, we in this paper present a deep convolutional neural field model for estimating depths from a single image, aiming to jointly explore the capacity of deep CNN and continuous CRF. Specifically, we propose a deep structured learning scheme which learns the unary and pairwise potentials of continuous CRF in a unified deep CNN framework. The proposed method can be used for depth estimations of general scenes with no geometric priors nor any extra in- formation injected. In our case, the integral of the partition function can be analytically calculated, thus we can exactly solve the log-likelihood optimization. Moreover, solving the MAP problem for predicting depths of a new image is highly efficient as closed-form solutions exist. We experimentally demonstrate that the proposed method outperforms state-of- the-art depth estimation methods on both indoor and out- door scene datasets.

Keywords

Conditional random fieldArtificial intelligenceConvolutional neural networkDeep learningComputer scienceUnary operationPrior probabilityPairwise comparisonImage (mathematics)Pattern recognition (psychology)Depth mapComputer visionMathematicsBayesian probability

Affiliated Institutions

The University of Adelaide AU

Related Publications

Conditional Random Fields as Recurrent Neural Networks

Shuai Zheng , Sadeep Jayasumana , Bernardino Romera‐Paredes +5 more

Pixel-level labelling tasks, such as semantic segmentation, play a central role in image understanding. Recent approaches have attempted to harness the capabilities of deep lear...

2015 2381 citations

UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation

Zongwei Zhou , Md Mahfuzur Rahman Siddiquee , Nima Tajbakhsh +1 more

The state-of-the-art models for medical image segmentation are variants of U-Net and fully convolutional networks (FCN). Despite their success, these models have two limitations...

2019 IEEE Transactions on Medical Imaging 3567 citations

Image Super-Resolution Using Deep Convolutional Networks

Chao Dong , Chen Change Loy , Kaiming He +1 more

We propose a deep learning method for single image super-resolution (SR). Our method directly learns an end-to-end mapping between the low/high-resolution images. The mapping is...

2015 IEEE Transactions on Pattern Analysis... 9271 citations

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick , Jeff Donahue , Trevor Darrell +1 more

Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that...

2014 30615 citations

Representation Learning: A Review and New Perspectives

Yoshua Bengio , Aaron Courville , P. M. Durai Raj Vincent

The success of machine learning algorithms generally depends on data representation, and we hypothesize that this is because different representations can entangle and hide more...

2013 IEEE Transactions on Pattern Analysis... 12373 citations

Publication Info

Year: 2015
Type: article
Citations: 863
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Deep convolutional neural fields for depth estimation from a single image

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

863

OpenAlex

Cite This

APA Style

                            
                                    Fayao Liu, 
                                
                                    Chunhua Shen, 
                                
                                    Guosheng Lin
                                
                            (2015). 
                            Deep convolutional neural fields for depth estimation from a single image. 
                            
                            .
                            https://doi.org/10.1109/cvpr.2015.7299152

Identifiers

DOI: 10.1109/cvpr.2015.7299152