Supervised classification algorithms require a sufficiently large set of representative training samples to generate accurate land-cover maps. Collecting reference data is difficult, expensive, and unfeasible at the large scale. To solve this problem, this article introduces a novel approach that aims to extract reliable labeled data from existing thematic products. Although these products represent a potentially useful information source, their use is not straightforward. They are not completely reliable since they may present classification errors. They are typically aggregated at polygon level, where polygons do not necessarily correspond to homogeneous areas. Finally, usually, there is a semantic gap between map legends and remote sensing (RS) data. In this context, we propose an approach that aims to: 1) perform a domain understanding to detect the discrepancies between the thematic map domain and the RS data domain; 2) use RS data contemporary to the map to decompose the thematic product from the semantic and spatial viewpoints; and 3) extract a database of informative and reliable training samples. The database of weak labeled units is used for training an ensemble of classifiers on recent data whose results are then combined in a majority voting rule. Two sets of experimental results obtained on MS images by extracting training samples from a crop type map and the 2018 Corine Land Cover (CLC) map, respectively, confirm the effectiveness of the proposed approach.
A Novel Approach to the Unsupervised Extraction of Reliable Training Samples From Thematic Products / Paris, Claudia; Bruzzone, Lorenzo. - In: IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING. - ISSN 0196-2892. - 59:3(2021), pp. 1930-1948. [10.1109/tgrs.2020.3001004]
A Novel Approach to the Unsupervised Extraction of Reliable Training Samples From Thematic Products
Paris, Claudia;Bruzzone, Lorenzo
2021-01-01
Abstract
Supervised classification algorithms require a sufficiently large set of representative training samples to generate accurate land-cover maps. Collecting reference data is difficult, expensive, and unfeasible at the large scale. To solve this problem, this article introduces a novel approach that aims to extract reliable labeled data from existing thematic products. Although these products represent a potentially useful information source, their use is not straightforward. They are not completely reliable since they may present classification errors. They are typically aggregated at polygon level, where polygons do not necessarily correspond to homogeneous areas. Finally, usually, there is a semantic gap between map legends and remote sensing (RS) data. In this context, we propose an approach that aims to: 1) perform a domain understanding to detect the discrepancies between the thematic map domain and the RS data domain; 2) use RS data contemporary to the map to decompose the thematic product from the semantic and spatial viewpoints; and 3) extract a database of informative and reliable training samples. The database of weak labeled units is used for training an ensemble of classifiers on recent data whose results are then combined in a majority voting rule. Two sets of experimental results obtained on MS images by extracting training samples from a crop type map and the 2018 Corine Land Cover (CLC) map, respectively, confirm the effectiveness of the proposed approach.File | Dimensione | Formato | |
---|---|---|---|
A_Novel_Approach_to_the_Unsupervised_Extraction_of_Reliable_Training_Samples_From_Thematic_Products_compressed.pdf
Solo gestori archivio
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
3.67 MB
Formato
Adobe PDF
|
3.67 MB | Adobe PDF | Visualizza/Apri |
3-parisbruzzone_compressed.pdf
accesso aperto
Tipologia:
Post-print referato (Refereed author’s manuscript)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
3.37 MB
Formato
Adobe PDF
|
3.37 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione