Real-time indoor scene description for the visually impaired using autoencoder fusion strategies with visible cameras

Malek, Salim; Melgani, Farid; Mekhalfi, Mohamed Lamine; Bazi, Yakoub

doi:10.3390/s17112641

This paper describes three coarse image description strategies, which are meant to promote a rough perception of surrounding objects for visually impaired individuals, with application to indoor spaces. The described algorithms operate on images (grabbed by the user, by means of a chest-mounted camera), and provide in output a list of objects that likely exist in his context across the indoor scene. In this regard, first, different colour, texture, and shape-based feature extractors are generated, followed by a feature learning step by means of AutoEncoder (AE) models. Second, the produced features are fused and fed into a multilabel classifier in order to list the potential objects. The conducted experiments point out that fusing a set of AE-learned features scores higher classification rates with respect to using the features individually. Furthermore, with respect to reference works, our method: (i) yields higher classification accuracies, and (ii) runs (at least four times) faster,...

Real-time indoor scene description for the visually impaired using autoencoder fusion strategies with visible cameras / Malek, Salim; Melgani, Farid; Mekhalfi, Mohamed Lamine; Bazi, Yakoub. - In: SENSORS. - ISSN 1424-8220. - 17:11(2017), pp. 264101-264114. [10.3390/s17112641]

Real-time indoor scene description for the visually impaired using autoencoder fusion strategies with visible cameras

Malek, Salim;Melgani, Farid;Mekhalfi, Mohamed Lamine;Bazi, Yakoub

2017-01-01

Abstract

This paper describes three coarse image description strategies, which are meant to promote a rough perception of surrounding objects for visually impaired individuals, with application to indoor spaces. The described algorithms operate on images (grabbed by the user, by means of a chest-mounted camera), and provide in output a list of objects that likely exist in his context across the indoor scene. In this regard, first, different colour, texture, and shape-based feature extractors are generated, followed by a feature learning step by means of AutoEncoder (AE) models. Second, the produced features are fused and fed into a multilabel classifier in order to list the potential objects. The conducted experiments point out that fusing a set of AE-learned features scores higher classification rates with respect to using the features individually. Furthermore, with respect to reference works, our method: (i) yields higher classification accuracies, and (ii) runs (at least four times) faster,...

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2017
			
	Titolo del periodico (Journal title)
	
				SENSORS
			
	Numero e parte del fascicolo (Issue number and part)
	
				11
			
	DOI
	
				https://dx.doi.org/10.3390/s17112641
			
	Codice PubMed (PubMed Identifier)
	
				29144395
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-85034836119
			
	Codice WOS (WOS identifier)
	
				WOS:000416790500203
			
	Tutti gli autori
	
						Malek, Salim; Melgani, Farid; Mekhalfi, Mohamed Lamine; Bazi, Yakoub
					
	Citazione
	
				Real-time indoor scene description for the visually impaired using autoencoder fusion strategies with visible cameras / Malek, Salim; Melgani, Farid; Mekhalfi, Mohamed Lamine; Bazi, Yakoub. - In: SENSORS. - ISSN 1424-8220. - 17:11(2017), pp. 264101-264114. [10.3390/s17112641]
			
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

File	Dimensione	Formato
Sensors-2017-Blind.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 2.59 MB Formato Adobe PDF Visualizza/Apri	2.59 MB	Adobe PDF	Visualizza/Apri