In this work, we present novel strategies to coarsely describe indoor scenes by listing the objects surrounding a blind person equipped with a portable digital camera. They rely on a new multilabeling approach which consists in computing the similarity between a query image and a set of multilabeled images stored in a library in order to pick up the most similar images. Since each image of the library conveys its own list of objects, the co-occurrence of objects between the most similar images is exploited to "multilabel" the query image. The multilabeling approach is implemented by means of three different strategies. They are respectively based on the scale invariant feature transform (SIFT), the notion of bag of words, and principal component analysis (PCA). The proposed methods were tested on datasets corresponding to two different public indoor sites. Promising results have been obtained and suggest that near real-time implementation can be envisioned for describing public indoor ...

Toward an Assisted Indoor Scene Perception for Blind People with Image Multilabeling Strategies

Mekhalfi, Mohamed Lamine;Melgani, Farid;Bazi, Yakoub;
2015-01-01

Abstract

In this work, we present novel strategies to coarsely describe indoor scenes by listing the objects surrounding a blind person equipped with a portable digital camera. They rely on a new multilabeling approach which consists in computing the similarity between a query image and a set of multilabeled images stored in a library in order to pick up the most similar images. Since each image of the library conveys its own list of objects, the co-occurrence of objects between the most similar images is exploited to "multilabel" the query image. The multilabeling approach is implemented by means of three different strategies. They are respectively based on the scale invariant feature transform (SIFT), the notion of bag of words, and principal component analysis (PCA). The proposed methods were tested on datasets corresponding to two different public indoor sites. Promising results have been obtained and suggest that near real-time implementation can be envisioned for describing public indoor ...
2015
6
Mekhalfi, Mohamed Lamine; Melgani, Farid; Bazi, Yakoub; Alajlan, N.
File in questo prodotto:
File Dimensione Formato  
ESA-2015-Blind.pdf

Solo gestori archivio

Descrizione: Versione Finale Pubblicata
Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 2.99 MB
Formato Adobe PDF
2.99 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/115390
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 18
  • ???jsp.display-item.citation.isi??? 15
  • OpenAlex ND
social impact