Semantic change detection (SCD) involves the simultaneous extraction of changed regions and their corresponding semantic classifications (pre- and post-change) in remote sensing images (RSIs). Despite recent advancements in vision foundation models (VFMs), the fast-segment anything model has demonstrated insufficient performance in SCD. In this article, we propose a novel VFMs architecture for SCD, designated as VFM-ReSCD. This architecture integrates a side adapter (SA) into the VFM-ReSCD to fine-tune the fast segment anything model (FastSAM) network, enabling zero-shot transfer to novel image distributions and tasks. This enhancement facilitates the extraction of spatial features from very high-resolution (VHR) RSIs. Moreover, we introduce a recurrent neural network (RNN) to model semantic correlation and capture feature changes. We evaluated the proposed methodology on two benchmark datasets. Extensive experiments show that our method achieves state-of-the-art (SOTA) performances over existing approaches and outperforms other CNN-based methods on two RSI datasets.

Recurrent Semantic Change Detection in VHR Remote Sensing Images Using Visual Foundation Models / Zhang, Jing; Ding, Lei; Zhou, Tingyuan; Wang, Jian; Atkinson, Peter M.; Bruzzone, Lorenzo. - In: IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING. - ISSN 1558-0644. - 63:(2025), pp. 1-14. [10.1109/TGRS.2025.3546808]

Recurrent Semantic Change Detection in VHR Remote Sensing Images Using Visual Foundation Models

Jing Zhang;Lei Ding;Lorenzo Bruzzone
2025-01-01

Abstract

Semantic change detection (SCD) involves the simultaneous extraction of changed regions and their corresponding semantic classifications (pre- and post-change) in remote sensing images (RSIs). Despite recent advancements in vision foundation models (VFMs), the fast-segment anything model has demonstrated insufficient performance in SCD. In this article, we propose a novel VFMs architecture for SCD, designated as VFM-ReSCD. This architecture integrates a side adapter (SA) into the VFM-ReSCD to fine-tune the fast segment anything model (FastSAM) network, enabling zero-shot transfer to novel image distributions and tasks. This enhancement facilitates the extraction of spatial features from very high-resolution (VHR) RSIs. Moreover, we introduce a recurrent neural network (RNN) to model semantic correlation and capture feature changes. We evaluated the proposed methodology on two benchmark datasets. Extensive experiments show that our method achieves state-of-the-art (SOTA) performances over existing approaches and outperforms other CNN-based methods on two RSI datasets.
2025
Zhang, Jing; Ding, Lei; Zhou, Tingyuan; Wang, Jian; Atkinson, Peter M.; Bruzzone, Lorenzo
Recurrent Semantic Change Detection in VHR Remote Sensing Images Using Visual Foundation Models / Zhang, Jing; Ding, Lei; Zhou, Tingyuan; Wang, Jian; Atkinson, Peter M.; Bruzzone, Lorenzo. - In: IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING. - ISSN 1558-0644. - 63:(2025), pp. 1-14. [10.1109/TGRS.2025.3546808]
File in questo prodotto:
File Dimensione Formato  
TGRS3546808.pdf

accesso aperto

Descrizione: Accepted version
Tipologia: Post-print referato (Refereed author’s manuscript)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 10.06 MB
Formato Adobe PDF
10.06 MB Adobe PDF Visualizza/Apri
Recurrent_Semantic_Change_Detection_in_VHR_Remote_Sensing_Images_Using_Visual_Foundation_Models.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 2.52 MB
Formato Adobe PDF
2.52 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/475679
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 11
  • OpenAlex 7
social impact