Multi-scale context aggregation for semantic segmentation of remote sensing images

IRIS

The semantic segmentation of remote sensing images (RSIs) is important in a variety of applications. Conventional encoder-decoder-based convolutional neural networks (CNNs) use cascade pooling operations to aggregate the semantic information, which results in a loss of localization accuracy and in the preservation of spatial details. To overcome these limitations, we introduce the use of the high-resolution network (HRNet) to produce high-resolution features without the decoding stage. Moreover, we enhance the low-to-high features extracted from different branches separately to strengthen the embedding of scale-related contextual information. The low-resolution features contain more semantic information and have a small spatial size; thus, they are utilized to model the long-term spatial correlations. The high-resolution branches are enhanced by introducing an adaptive spatial pooling (ASP) module to aggregate more local contexts. By combining these context aggregation designs across different levels, the resulting architecture is capable of exploiting spatial context at both global and local levels. The experimental results obtained on two RSI datasets show that our approach significantly improves the accuracy with respect to the commonly used CNNs and achieves state-of-the-art performance.

Multi-scale context aggregation for semantic segmentation of remote sensing images / Zhang, J.; Lin, S.; Ding, L.; Bruzzone, L.. - In: REMOTE SENSING. - ISSN 2072-4292. - 12:4(2020), pp. 70101-70116. [10.3390/rs12040701]

Multi-scale context aggregation for semantic segmentation of remote sensing images

Zhang J.;Lin S.;Ding L.;Bruzzone L.

2020-01-01

Abstract

The semantic segmentation of remote sensing images (RSIs) is important in a variety of applications. Conventional encoder-decoder-based convolutional neural networks (CNNs) use cascade pooling operations to aggregate the semantic information, which results in a loss of localization accuracy and in the preservation of spatial details. To overcome these limitations, we introduce the use of the high-resolution network (HRNet) to produce high-resolution features without the decoding stage. Moreover, we enhance the low-to-high features extracted from different branches separately to strengthen the embedding of scale-related contextual information. The low-resolution features contain more semantic information and have a small spatial size; thus, they are utilized to model the long-term spatial correlations. The high-resolution branches are enhanced by introducing an adaptive spatial pooling (ASP) module to aggregate more local contexts. By combining these context aggregation designs across different levels, the resulting architecture is capable of exploiting spatial context at both global and local levels. The experimental results obtained on two RSI datasets show that our approach significantly improves the accuracy with respect to the commonly used CNNs and achieves state-of-the-art performance.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
			2020
		
	Titolo del periodico (Journal title)
	
			REMOTE SENSING
		
	Numero e parte del fascicolo (Issue number and part)
	
			4
		
	DOI
	
			https://dx.doi.org/10.3390/rs12040701
		
	Codice Scopus (Scopus identifier)
	
			2-s2.0-85080866180
		
	Codice WOS (WOS identifier)
	
			WOS:000519564600112
		
	Tutti gli autori
	
			Zhang, J.; Lin, S.; Ding, L.; Bruzzone, L.
		
	Citazione
	
			Multi-scale context aggregation for semantic segmentation of remote sensing images / Zhang, J.; Lin, S.; Ding, L.; Bruzzone, L.. - In: REMOTE SENSING. - ISSN 2072-4292. - 12:4(2020), pp. 70101-70116. [10.3390/rs12040701]
		
	Appare nelle tipologie:
	
			03.1 Articolo su rivista (Journal article)

File in questo prodotto:

File	Dimensione	Formato
RemoteSensing_2020.02_Multi-Scale Context Aggregation for Semantic Segmentation of Remote Sensing Images.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Creative commons Dimensione 6.95 MB Formato Adobe PDF Visualizza/Apri	6.95 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/279195

Citazioni

ND

111

98

social impact