This paper presents a fast and robust framework integrating local features for the matching of multimodal geospatial data (e.g., optical, LiDAR, SAR and map). In the proposed framework, local feature descriptors, such as Histogram of Oriented Gradient (HOG) and Local Self Similarity (LSS), are first extracted for every pixel to form a pixel-wise structural feature representation of an image. Then we define a similarity metric based on the feature representation in frequency domain using the 3 Dimensional Fast Fourier Transform (3DFFT) technique, followed by a template matching scheme to detect control points between multimodal data. The proposed framework is based on the hypothesis that structural similarity between images is preserved across different modalities. The major advantages of this framework include (1) structural similarity representation using pixel-wise feature description and (2) high computational efficiency due to the use of 3DFFT. Experimental results on different types of multimodal geospatial data show more accurate matching performance of the proposed framework than the state-of-the-art methods.

Fast and robust structure-based multimodal geospatial image matching / Ye, Yuanxin; Bruzzone, Lorenzo; Shan, Jie; Shen, Li. - ELETTRONICO. - (2017), pp. 5141-5144. (Intervento presentato al convegno IGARSS 2017 tenutosi a Fort Worth, Texas, USA nel 23-28 July 2017) [10.1109/IGARSS.2017.8128160].

Fast and robust structure-based multimodal geospatial image matching

Ye, Yuanxin;Bruzzone, Lorenzo;
2017-01-01

Abstract

This paper presents a fast and robust framework integrating local features for the matching of multimodal geospatial data (e.g., optical, LiDAR, SAR and map). In the proposed framework, local feature descriptors, such as Histogram of Oriented Gradient (HOG) and Local Self Similarity (LSS), are first extracted for every pixel to form a pixel-wise structural feature representation of an image. Then we define a similarity metric based on the feature representation in frequency domain using the 3 Dimensional Fast Fourier Transform (3DFFT) technique, followed by a template matching scheme to detect control points between multimodal data. The proposed framework is based on the hypothesis that structural similarity between images is preserved across different modalities. The major advantages of this framework include (1) structural similarity representation using pixel-wise feature description and (2) high computational efficiency due to the use of 3DFFT. Experimental results on different types of multimodal geospatial data show more accurate matching performance of the proposed framework than the state-of-the-art methods.
2017
2017 IEEE International Geoscience & Remote Sensing Symposium Proceedings
Piscataway, NJ
IEEE
978-1-5090-4951-6
Ye, Yuanxin; Bruzzone, Lorenzo; Shan, Jie; Shen, Li
Fast and robust structure-based multimodal geospatial image matching / Ye, Yuanxin; Bruzzone, Lorenzo; Shan, Jie; Shen, Li. - ELETTRONICO. - (2017), pp. 5141-5144. (Intervento presentato al convegno IGARSS 2017 tenutosi a Fort Worth, Texas, USA nel 23-28 July 2017) [10.1109/IGARSS.2017.8128160].
File in questo prodotto:
File Dimensione Formato  
Fast and robust structure-based multimodal geospatial image matching.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.05 MB
Formato Adobe PDF
1.05 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/193927
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 10
  • ???jsp.display-item.citation.isi??? 8
social impact