Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation

Xu, Dan; Wang, Wei; Tang, Hao; Liu, Hong; Sebe, Nicu; Ricci, Elisa

doi:10.1109/CVPR.2018.00412

Recent works have shown the benefit of integrating Conditional Random Fields (CRFs) models into deep architectures for improving pixel-level prediction tasks. Following this line of research, in this paper we introduce a novel approach for monocular depth estimation. Similarly to previous works, our method employs a continuous CRF to fuse multi-scale information derived from different layers of a front-end Convolutional Neural Network (CNN). Differently from past works, our approach benefits from a structured attention model which automatically regulates the amount of information transferred between corresponding features at different scales. Importantly, the proposed attention model is seamlessly integrated into the CRF, allowing end-to-end training of the entire architecture. Our extensive experimental evaluation demonstrates the effectiveness of the proposed method which is competitive with previous methods on the KITTI benchmark and outperforms the state of the art on the NYU Depth V2 dataset.

Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation / Xu, Dan; Wang, Wei; Tang, Hao; Liu, Hong; Sebe, Nicu; Ricci, Elisa. - (2018), pp. 3917-3925. (Intervento presentato al convegno CVPR tenutosi a Salt Lake City, UT, USA nel 18-23 June 2018) [10.1109/CVPR.2018.00412].

Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation

Xu, Dan;Wang, Wei;Tang, Hao;Liu, Hong;Sebe, Nicu;Ricci, Elisa

2018-01-01

Abstract

Recent works have shown the benefit of integrating Conditional Random Fields (CRFs) models into deep architectures for improving pixel-level prediction tasks. Following this line of research, in this paper we introduce a novel approach for monocular depth estimation. Similarly to previous works, our method employs a continuous CRF to fuse multi-scale information derived from different layers of a front-end Convolutional Neural Network (CNN). Differently from past works, our approach benefits from a structured attention model which automatically regulates the amount of information transferred between corresponding features at different scales. Importantly, the proposed attention model is seamlessly integrated into the CRF, allowing end-to-end training of the entire architecture. Our extensive experimental evaluation demonstrates the effectiveness of the proposed method which is competitive with previous methods on the KITTI benchmark and outperforms the state of the art on the NYU Depth V2 dataset.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2018
			
	Titolo del volume (Proceedings title)
	
				IEEE/CVF Conference on Computer Vision and Pattern Recognition
			
	Luogo di edizione (Place of publication)
	
				Piscataway, NJ USA
			
	Casa editrice (Publisher)
	
				IEEE
			
	ISBN
	
				978-1-5386-6420-9
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85056792057
			
	Codice WOS (WOS identifier)
	
				WOS:000457843604007
			
	Tutti gli autori
	
						Xu, Dan; Wang, Wei; Tang, Hao; Liu, Hong; Sebe, Nicu; Ricci, Elisa
					
	Citazione
	
				Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation / Xu, Dan; Wang, Wei; Tang, Hao; Liu, Hong; Sebe, Nicu; Ricci, Elisa. - (2018), pp. 3917-3925. (Intervento presentato al  convegno CVPR tenutosi a Salt Lake City, UT, USA nel 18-23 June 2018) [10.1109/CVPR.2018.00412].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
structured.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 2.22 MB Formato Adobe PDF Visualizza/Apri	2.22 MB	Adobe PDF	Visualizza/Apri