MSCD-Net: From Unimodal to Multimodal Semantic Change Detection

Wang, Jian; Xie, Hong; Yan, Li; Zhou, Tingyuan; Wang, Yanheng; Zhang, Jing; Bruzzone, Lorenzo; Atkinson, Peter M.

doi:10.1109/TGRS.2025.3591814

Semantic change detection (SCD) involves temporal changes and spatial semantics. Its working principle and processing flow usually include land semantic segmentation (LSS) and binary change detection (BCD). Due to its significant impact and practical value, SCD has received consistently wide attention in Earth observation. Nowadays, remote sensing (RS) data in various modalities are proliferating, calling for an urgent need to develop intelligent algorithms for multimodal RS data. However, no efficient multimodal SCD methods exist currently. To address this limitation, this work proposes the first deep learning-based multimodal SCD method: MSCD-Net. MSCD-Net extracts multiscale semantic and difference features after fusing multimodal features, and then aggregates and refines these features to output high-quality semantic segmentation and change maps. Additionally, a semantic difference decoder (SDD) module is designed to model semantic and difference features jointly. It can be integrated with existing methods to increase accuracy. Experimental results demonstrate that MSCD-Net achieves state-of-the-art performance on both multimodal and unimodal SCD datasets, and SDD has strong feature learning ability and compatibility. These findings imply that MSCD-Net is expected to promote the development and application of multimodal SCD.

MSCD-Net: From Unimodal to Multimodal Semantic Change Detection / Wang, J., Xie, H., Yan, L.i., Zhou, T., Wang, Y., Zhang, J., Bruzzone, L., Atkinson, P.M.. - In: IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING. - ISSN 1558-0644. - 63:4508017(2025), pp. 1-17. [10.1109/TGRS.2025.3591814]

MSCD-Net: From Unimodal to Multimodal Semantic Change Detection

Jian Wang;Hong Xie;Li Yan;Tingyuan Zhou;Yanheng Wang;Jing Zhang;Lorenzo Bruzzone;Peter M. Atkinson

2025-01-01

Abstract

Semantic change detection (SCD) involves temporal changes and spatial semantics. Its working principle and processing flow usually include land semantic segmentation (LSS) and binary change detection (BCD). Due to its significant impact and practical value, SCD has received consistently wide attention in Earth observation. Nowadays, remote sensing (RS) data in various modalities are proliferating, calling for an urgent need to develop intelligent algorithms for multimodal RS data. However, no efficient multimodal SCD methods exist currently. To address this limitation, this work proposes the first deep learning-based multimodal SCD method: MSCD-Net. MSCD-Net extracts multiscale semantic and difference features after fusing multimodal features, and then aggregates and refines these features to output high-quality semantic segmentation and change maps. Additionally, a semantic difference decoder (SDD) module is designed to model semantic and difference features jointly. It can be integrated with existing methods to increase accuracy. Experimental results demonstrate that MSCD-Net achieves state-of-the-art performance on both multimodal and unimodal SCD datasets, and SDD has strong feature learning ability and compatibility. These findings imply that MSCD-Net is expected to promote the development and application of multimodal SCD.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2025
			
	Titolo del periodico (Journal title)
	
				IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING
			
	Numero e parte del fascicolo (Issue number and part)
	
				4508017
			
	DOI
	
				https://dx.doi.org/10.1109/TGRS.2025.3591814
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-105011957466
			
	Codice WOS (WOS identifier)
	
				WOS:001567061200029
			
	Tutti gli autori
	
						Wang, Jian; Xie, Hong; Yan, Li; Zhou, Tingyuan; Wang, Yanheng; Zhang, Jing; Bruzzone, Lorenzo; Atkinson, Peter M.
					
	Citazione
	
				MSCD-Net: From Unimodal to Multimodal Semantic Change Detection / Wang, J., Xie, H., Yan, L.i., Zhou, T., Wang, Y., Zhang, J., Bruzzone, L., Atkinson, P.M.. - In: IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING. - ISSN 1558-0644. - 63:4508017(2025), pp. 1-17. [10.1109/TGRS.2025.3591814]
			
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

File	Dimensione	Formato
TGRS3591814.pdf accesso aperto Descrizione: This article has been accepted for publication in IEEE Transactions on Geoscience and Remote Sensing. This is the author's version which has not been fully edited and content may change prior to final publication. Citation information: DOI 10.1109/TGRS.2025.3591814 Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 26.16 MB Formato Adobe PDF Visualizza/Apri	26.16 MB	Adobe PDF	Visualizza/Apri
MSCD-Net_From_Unimodal_to_Multimodal_Semantic_Change_Detection.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 14.21 MB Formato Adobe PDF Visualizza/Apri	14.21 MB	Adobe PDF	Visualizza/Apri
MSCD-Net_From_Unimodal_to_Multimodal_Semantic_Change_Detection_compressed (1).pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 3.97 MB Formato Adobe PDF Visualizza/Apri	3.97 MB	Adobe PDF	Visualizza/Apri