Recent deep monocular depth estimation approaches based on supervised regression have achieved remarkable performance. However, they require costly ground truth annotations during training. To cope with this issue, in this paper we present a novel unsupervised deep learning approach for predicting depth maps. We introduce a new network architecture, named Progressive Fusion Network (PFN), that is specifically designed for binocular stereo depth estimation. This network is based on a multi-scale refinement strategy that combines the information provided by both stereo views. In addition, we propose to stack twice this network in order to form a cycle. This cycle approach can be interpreted as a form of data-augmentation since, at training time, the network learns both from the training set images (in the forward half-cycle) but also from the synthesized images (in the backward half-cycle). The architecture is jointly trained with adversarial learning. Extensive experiments on the publicly available datasets KITTI, Cityscapes and ApolloScape demonstrate the effectiveness of the proposed model which is competitive with other unsupervised deep learning methods for depth prediction.

Progressive Fusion for Unsupervised Binocular Depth Estimation Using Cycled Networks / Pilzer, Andrea; Lathuiliere, Stephane; Xu, Dan; Puscas, Mihai Marian; Ricci, Elisa; Sebe, Nicu. - In: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE. - ISSN 0162-8828. - 42:10(2020), pp. 2380-2395. [10.1109/TPAMI.2019.2942928]

Progressive Fusion for Unsupervised Binocular Depth Estimation Using Cycled Networks

Pilzer, Andrea;Lathuiliere, Stephane;Xu, Dan;Puscas, Mihai Marian;Ricci, Elisa;Sebe, Nicu
2020-01-01

Abstract

Recent deep monocular depth estimation approaches based on supervised regression have achieved remarkable performance. However, they require costly ground truth annotations during training. To cope with this issue, in this paper we present a novel unsupervised deep learning approach for predicting depth maps. We introduce a new network architecture, named Progressive Fusion Network (PFN), that is specifically designed for binocular stereo depth estimation. This network is based on a multi-scale refinement strategy that combines the information provided by both stereo views. In addition, we propose to stack twice this network in order to form a cycle. This cycle approach can be interpreted as a form of data-augmentation since, at training time, the network learns both from the training set images (in the forward half-cycle) but also from the synthesized images (in the backward half-cycle). The architecture is jointly trained with adversarial learning. Extensive experiments on the publicly available datasets KITTI, Cityscapes and ApolloScape demonstrate the effectiveness of the proposed model which is competitive with other unsupervised deep learning methods for depth prediction.
2020
10
Pilzer, Andrea; Lathuiliere, Stephane; Xu, Dan; Puscas, Mihai Marian; Ricci, Elisa; Sebe, Nicu
Progressive Fusion for Unsupervised Binocular Depth Estimation Using Cycled Networks / Pilzer, Andrea; Lathuiliere, Stephane; Xu, Dan; Puscas, Mihai Marian; Ricci, Elisa; Sebe, Nicu. - In: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE. - ISSN 0162-8828. - 42:10(2020), pp. 2380-2395. [10.1109/TPAMI.2019.2942928]
File in questo prodotto:
File Dimensione Formato  
08846077.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 5.94 MB
Formato Adobe PDF
5.94 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/274467
Citazioni
  • ???jsp.display-item.citation.pmc??? 0
  • Scopus 17
  • ???jsp.display-item.citation.isi??? 18
social impact