An Interactive Strategy for the Training Set Definition Based on Active Self-Paced Learning Implemented on a Cloud-Computing Platform

Paris, Claudia; Orlandi, Luca; Bruzzone, Lorenzo

doi:10.1109/LGRS.2021.3114611

Supervised classification of remote sensing data requires a large number of high-quality annotated samples. At the operational level, the definition of a large training set by photograph interpretation is costly and time-consuming. The manual annotation activity is typically supported by high-resolution satellite data. Therefore, when working at country or continental scale, it is necessary to efficiently access large archives of remotely sensed data. To address these issues, this letter presents an interactive strategy implemented in a cloud-computing platform for defining effective training sets with significantly reduced human effort. This is achieved by combining active learning (AL) and self-paced learning (SPL) techniques. First, an initial training set is used to classify the pool of unlabeled samples. Then, the method progressively adds high-confidence samples, selected through an SPL strategy, and low-confidence samples selected considering an AL strategy. While the high-confidence sample labels are self-paced, the low-confidence ones are manually assigned. The cloud-computing platform allows the: 1) definition of a complete training set in a fast and efficient way and 2) access to a multipetabyte catalog of satellite imagery. Experiments carried out on the Google Earth Engine (GEE) Platform demonstrate the effectiveness of the proposed strategy compared to the standard manual annotation.

An Interactive Strategy for the Training Set Definition Based on Active Self-Paced Learning Implemented on a Cloud-Computing Platform / Paris, Claudia; Orlandi, Luca; Bruzzone, Lorenzo. - In: IEEE GEOSCIENCE AND REMOTE SENSING LETTERS. - ISSN 1545-598X. - 19:(2022), pp. 1-5. [10.1109/LGRS.2021.3114611]

An Interactive Strategy for the Training Set Definition Based on Active Self-Paced Learning Implemented on a Cloud-Computing Platform

Paris, Claudia;Orlandi, Luca;Bruzzone, Lorenzo

2022-01-01

Abstract

Supervised classification of remote sensing data requires a large number of high-quality annotated samples. At the operational level, the definition of a large training set by photograph interpretation is costly and time-consuming. The manual annotation activity is typically supported by high-resolution satellite data. Therefore, when working at country or continental scale, it is necessary to efficiently access large archives of remotely sensed data. To address these issues, this letter presents an interactive strategy implemented in a cloud-computing platform for defining effective training sets with significantly reduced human effort. This is achieved by combining active learning (AL) and self-paced learning (SPL) techniques. First, an initial training set is used to classify the pool of unlabeled samples. Then, the method progressively adds high-confidence samples, selected through an SPL strategy, and low-confidence samples selected considering an AL strategy. While the high-confidence sample labels are self-paced, the low-confidence ones are manually assigned. The cloud-computing platform allows the: 1) definition of a complete training set in a fast and efficient way and 2) access to a multipetabyte catalog of satellite imagery. Experiments carried out on the Google Earth Engine (GEE) Platform demonstrate the effectiveness of the proposed strategy compared to the standard manual annotation.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2022
			
	Titolo del periodico (Journal title)
	
				IEEE GEOSCIENCE AND REMOTE SENSING LETTERS
			
	DOI
	
				https://dx.doi.org/10.1109/LGRS.2021.3114611
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-85117116266
			
	Codice WOS (WOS identifier)
	
				WOS:000732099900001
			
	Tutti gli autori
	
						Paris, Claudia; Orlandi, Luca; Bruzzone, Lorenzo
					
	Citazione
	
				An Interactive Strategy for the Training Set Definition Based on Active Self-Paced Learning Implemented on a Cloud-Computing Platform / Paris, Claudia; Orlandi, Luca; Bruzzone, Lorenzo. - In: IEEE GEOSCIENCE AND REMOTE SENSING LETTERS. - ISSN 1545-598X. - 19:(2022), pp. 1-5. [10.1109/LGRS.2021.3114611]
			
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

File	Dimensione	Formato
3-GRSL-01258-2021.pdf Solo gestori archivio Tipologia: Pre-print non referato (Non-refereed preprint) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 8.99 MB Formato Adobe PDF Visualizza/Apri	8.99 MB	Adobe PDF	Visualizza/Apri
An_Interactive_Strategy_for_the_Training_Set_Definition_Based_on_Active_Self-Paced_Learning_Implemented_on_a_Cloud-Computing_Platform(1).pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 5.27 MB Formato Adobe PDF Visualizza/Apri	5.27 MB	Adobe PDF	Visualizza/Apri