—Depth cues have been proved very useful in various computer vision and robotic tasks. This paper addresses the problem of monocular depth estimation from a single still image. Inspired by the effectiveness of recent works on multi-scale convolutional neural networks (CNN), we propose a deep model which fuses complementary information derived from multiple CNN side outputs. Different from previous methods using concatenation or weighted average schemes, the integration is obtained by means of continuous Conditional Random Fields (CRFs). In particular, we propose two different variations, one based on a cascade of multiple CRFs, the other on a unified graphical model. By designing a novel CNN implementation of mean-field updates for continuous CRFs, we show that both proposed models can be regarded as sequential deep networks and that training can be performed end-to-end. Through an extensive experimental evaluation, we demonstrate the effectiveness of the proposed approach and establish new state of the art results for the monocular depth estimation task on three publicly available datasets, i.e., NYUD-V2, Make3D and KITTI.

Monocular Depth Estimation using Multi-Scale Continuous CRFs as Sequential Deep Networks / Xu, Dan; Ricci, Elisa; Ouyang, Wanli; Wang, Xiaogang; Sebe, Nicu. - In: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE. - ISSN 0162-8828. - 2019:(2019), pp. 1426-1440. [10.1109/TPAMI.2018.2839602]

Monocular Depth Estimation using Multi-Scale Continuous CRFs as Sequential Deep Networks

Xu, Dan;Ricci, Elisa;Sebe, Nicu
2019-01-01

Abstract

—Depth cues have been proved very useful in various computer vision and robotic tasks. This paper addresses the problem of monocular depth estimation from a single still image. Inspired by the effectiveness of recent works on multi-scale convolutional neural networks (CNN), we propose a deep model which fuses complementary information derived from multiple CNN side outputs. Different from previous methods using concatenation or weighted average schemes, the integration is obtained by means of continuous Conditional Random Fields (CRFs). In particular, we propose two different variations, one based on a cascade of multiple CRFs, the other on a unified graphical model. By designing a novel CNN implementation of mean-field updates for continuous CRFs, we show that both proposed models can be regarded as sequential deep networks and that training can be performed end-to-end. Through an extensive experimental evaluation, we demonstrate the effectiveness of the proposed approach and establish new state of the art results for the monocular depth estimation task on three publicly available datasets, i.e., NYUD-V2, Make3D and KITTI.
2019
Xu, Dan; Ricci, Elisa; Ouyang, Wanli; Wang, Xiaogang; Sebe, Nicu
Monocular Depth Estimation using Multi-Scale Continuous CRFs as Sequential Deep Networks / Xu, Dan; Ricci, Elisa; Ouyang, Wanli; Wang, Xiaogang; Sebe, Nicu. - In: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE. - ISSN 0162-8828. - 2019:(2019), pp. 1426-1440. [10.1109/TPAMI.2018.2839602]
File in questo prodotto:
File Dimensione Formato  
Monocular_Depth_Estimation_Using_Multi-Scale_Continuous_CRFs_as_Sequential_Deep_Networks.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 5.36 MB
Formato Adobe PDF
5.36 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/225333
Citazioni
  • ???jsp.display-item.citation.pmc??? 4
  • Scopus 63
  • ???jsp.display-item.citation.isi??? 44
social impact