Depth and surface normal estimation from monocular images using regression on deep features and hierarchical CRFs

Bo Li(University of Adelaide), Chunhua Shen(Australian Centre for Robotic Vision), Yuchao Dai(Australian National University), Anton van den Hengel(University of Adelaide), Mingyi He(Northwestern Polytechnical University)
Unknown
June 1, 2015
Cited by 645

Abstract

Predicting the depth (or surface normal) of a scene from single monocular color images is a challenging task. This paper tackles this challenging and essentially underdetermined problem by regression on deep convolutional neural network (DCNN) features, combined with a post-processing refining step using conditional random fields (CRF). Our framework works at two levels, super-pixel level and pixel level. First, we design a DCNN model to learn the mapping from multi-scale image patches to depth or surface normal values at the super-pixel level. Second, the estimated super-pixel depth or surface normal is refined to the pixel level by exploiting various potentials on the depth or surface normal map, which includes a data term, a smoothness term among super-pixels and an auto-regression term characterizing the local structure of the estimation map. The inference problem can be efficiently solved because it admits a closed-form solution. Experiments on the Make3D and NYU Depth V2 datasets show competitive results compared with recent state-of-the-art methods.


Related Papers

No related papers found

Powered by citation graph analysis