Beyond Pixel-Level Annotation: Exploring Self-Supervised Learning for Change Detection With Image-Level Supervision

Zhao, Maofan; Xinli, Hu; Zhang, Linlin; Meng, Qingyan; Chen, Yuxing; Bruzzone, Lorenzo

doi:10.1109/TGRS.2024.3379431

Change detection (CD) in high-resolution remote sensing has received large attention due to its wide range of applications. Many methods have been proposed in the literature and achieved excellent performance. However, they are often fully supervised, thus requiring abundant pixel-level labeled samples, which is time-consuming and labor-intensive. Especially compared to the common single-temporal interpretation, labeling bi-temporal images is often more complicated. Therefore, this study combines weakly supervised learning (WSL) to reduce label acquisition costs. However, changed regions are small, fragmented, and similar to the background, which increases the gap between weakly supervised and fully supervised tasks. To address these difficulties, we explore self-supervised methods to construct a WSL framework based on image-level labels for general CD, termed WSLCD in this article. First, we design a double-branch Siamese network to derive embeddings and initial class attention maps (...

Change detection (CD) in high-resolution remote sensing has received large attention due to its wide range of applications. Many methods have been proposed in the literature and achieved excellent performance. However, they are often fully supervised, thus requiring abundant pixel-level labeled samples, which is time-consuming and labor-intensive. Especially compared to the common single-temporal interpretation, labeling bi-temporal images is often more complicated. Therefore, this study combines weakly supervised learning (WSL) to reduce label acquisition costs. However, changed regions are small, fragmented, and similar to the background, which increases the gap between weakly supervised and fully supervised tasks. To address these difficulties, we explore self-supervised methods to construct a WSL framework based on image-level labels for general CD, termed WSLCD in this article. First, we design a double-branch Siamese network to derive embeddings and initial class attention maps (CAMs), which input the original image pair and the spatially transformed image pair. Second, mutual learning and equivariant regularization (MLER) are enforced on CAMs from different views, which implements consistency constraints in confusion regions and makes CAMs learn from each other based on saliency regions. Furthermore, prototype-based contrastive learning (PCL) is designed such that unreliable pixels can learn from prototypes computed from reliable pixels. PCL includes intraview contrast and cross-view contrast depending on whether the prototypes and class embeddings are from the same view. With the above strategies, we narrow the gap between image-level weakly supervised CD and fully supervised CD. Experiments are conducted on three CD datasets, including CLCD, DSIFN, and GCD. Our method achieves state-of-the-art performance on pseudo-label generation and CD. The code is available at https://github.com/mfzhao1998/WSLCD.

Beyond Pixel-Level Annotation: Exploring Self-Supervised Learning for Change Detection With Image-Level Supervision / Zhao, Maofan; Hu, Xinli; Zhang, Linlin; Meng, Qingyan; Chen, Yuxing; Bruzzone, Lorenzo. - In: IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING. - ISSN 0196-2892. - 62:5614916(2024), pp. 1-16. [10.1109/TGRS.2024.3379431]

Beyond Pixel-Level Annotation: Exploring Self-Supervised Learning for Change Detection With Image-Level Supervision

Maofan Zhao;Xinli Hu;Linlin Zhang;Qingyan Meng;Yuxing Chen;Lorenzo Bruzzone

2024-01-01

Abstract

Change detection (CD) in high-resolution remote sensing has received large attention due to its wide range of applications. Many methods have been proposed in the literature and achieved excellent performance. However, they are often fully supervised, thus requiring abundant pixel-level labeled samples, which is time-consuming and labor-intensive. Especially compared to the common single-temporal interpretation, labeling bi-temporal images is often more complicated. Therefore, this study combines weakly supervised learning (WSL) to reduce label acquisition costs. However, changed regions are small, fragmented, and similar to the background, which increases the gap between weakly supervised and fully supervised tasks. To address these difficulties, we explore self-supervised methods to construct a WSL framework based on image-level labels for general CD, termed WSLCD in this article. First, we design a double-branch Siamese network to derive embeddings and initial class attention maps (...

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2024
			
	Titolo del periodico (Journal title)
	
				IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING
			
	Numero e parte del fascicolo (Issue number and part)
	
				5614916
			
	DOI
	
				https://dx.doi.org/10.1109/TGRS.2024.3379431
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-85188554320
			
	Codice WOS (WOS identifier)
	
				WOS:001206028600010
			
	Tutti gli autori
	
						Zhao, Maofan; Hu, Xinli; Zhang, Linlin; Meng, Qingyan; Chen, Yuxing; Bruzzone, Lorenzo
					
	Citazione
	
				Beyond Pixel-Level Annotation: Exploring Self-Supervised Learning for Change Detection With Image-Level Supervision / Zhao, Maofan; Hu, Xinli; Zhang, Linlin; Meng, Qingyan; Chen, Yuxing; Bruzzone, Lorenzo. - In: IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING. - ISSN 0196-2892. - 62:5614916(2024), pp. 1-16. [10.1109/TGRS.2024.3379431]
			
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

File	Dimensione	Formato
TGRS3379431.pdf embargo fino al 19/03/2026 Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 26.95 MB Formato Adobe PDF Visualizza/Apri	26.95 MB	Adobe PDF	Visualizza/Apri
Beyond_Pixel-Level_Annotation_Exploring_Self-Supervised_Learning_for_Change_Detection_With_Image-Level_Supervision.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 5.37 MB Formato Adobe PDF Visualizza/Apri	5.37 MB	Adobe PDF	Visualizza/Apri