This paper presents a novel remote sensing (RS) image retrieval system that is defined based on generation and exploitation of textual descriptions that model the content of RS images. The proposed RS image retrieval system is composed of three main steps. The first one generates textual descriptions of the content of the RS images combining a convolutional neural network (CNN) and a recurrent neural network (RNN) to extract the features of the images and to generate the descriptions of their content, respectively. The second step encodes the semantic content of the generated descriptions using word embedding techniques able to produce semantically rich word vectors. The third step retrieves the most similar images with respect to the query image by measuring the similarity between the encoded generated textual descriptions of the query image and those of the archive. Experimental results on RS image archive composed of RS images acquired by unmanned aerial vehicles (UAVs) are reported and discussed.

Retrieving Images with Generated Textual Descriptions / Hoxha, Genc; Melgani, F.; Demir, B.. - (2019), pp. 5812-5815. (Intervento presentato al convegno 39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 tenutosi a Convention Center "Pacifico Yokohama", jpn nel 2019) [10.1109/IGARSS.2019.8899321].

Retrieving Images with Generated Textual Descriptions

Hoxha, Genc;Melgani F.;Demir B.
2019-01-01

Abstract

This paper presents a novel remote sensing (RS) image retrieval system that is defined based on generation and exploitation of textual descriptions that model the content of RS images. The proposed RS image retrieval system is composed of three main steps. The first one generates textual descriptions of the content of the RS images combining a convolutional neural network (CNN) and a recurrent neural network (RNN) to extract the features of the images and to generate the descriptions of their content, respectively. The second step encodes the semantic content of the generated descriptions using word embedding techniques able to produce semantically rich word vectors. The third step retrieves the most similar images with respect to the query image by measuring the similarity between the encoded generated textual descriptions of the query image and those of the archive. Experimental results on RS image archive composed of RS images acquired by unmanned aerial vehicles (UAVs) are reported and discussed.
2019
International Geoscience and Remote Sensing Symposium (IGARSS)
New York, USA
Institute of Electrical and Electronics Engineers Inc.
978-1-5386-9154-0
Hoxha, Genc; Melgani, F.; Demir, B.
Retrieving Images with Generated Textual Descriptions / Hoxha, Genc; Melgani, F.; Demir, B.. - (2019), pp. 5812-5815. (Intervento presentato al convegno 39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 tenutosi a Convention Center "Pacifico Yokohama", jpn nel 2019) [10.1109/IGARSS.2019.8899321].
File in questo prodotto:
File Dimensione Formato  
Paper.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 705.57 kB
Formato Adobe PDF
705.57 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/250875
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 8
  • ???jsp.display-item.citation.isi??? 3
  • OpenAlex ND
social impact