Boosting Deep Open World Recognition by Clustering

Fontanel, D.; Cermelli, F.; Mancini, M.; Bulo, S. R.; Ricci, E.; Caputo, B.

doi:10.1109/LRA.2020.3010753

While convolutional neural networks have brought significant advances in robot vision, their ability is often limited to closed world scenarios, where the number of semantic concepts to be recognized is determined by the available training set. Since it is practically impossible to capture all possible semantic concepts present in the real world in a single training set, we need to break the closed world assumption, equipping our robot with the capability to act in an open world. To provide such ability, a robot vision system should be able to (i) identify whether an instance does not belong to the set of known categories (i.e., open set recognition), and (ii) extend its knowledge to learn new classes over time (i.e., incremental learning). In this work, we show how we can boost the performance of deep open world recognition algorithms by means of a new loss formulation enforcing a global to local clustering of class-specific features. In particular, a first loss term, i.e., global clustering, forces the network to map samples closer to the class centroid they belong to while the second one, local clustering, shapes the representation space in such a way that samples of the same class get closer in the representation space while pushing away neighbours belonging to other classes. Moreover, we propose a strategy to learn class-specific rejection thresholds, instead of heuristically estimating a single global threshold, as in previous works. Experiments on three benchmarks show the effectiveness of our approach.

Boosting Deep Open World Recognition by Clustering / Fontanel, D., Cermelli, F., Mancini, M., Bulo, S.R., Ricci, E., Caputo, B.. - In: IEEE ROBOTICS AND AUTOMATION LETTERS. - ISSN 2377-3766. - 5:4(2020), pp. 5985-5992. [10.1109/LRA.2020.3010753]

Boosting Deep Open World Recognition by Clustering

Fontanel D.;Cermelli F.;Mancini M.;Bulo S. R.;Ricci E.;Caputo B.

2020-01-01

Abstract

While convolutional neural networks have brought significant advances in robot vision, their ability is often limited to closed world scenarios, where the number of semantic concepts to be recognized is determined by the available training set. Since it is practically impossible to capture all possible semantic concepts present in the real world in a single training set, we need to break the closed world assumption, equipping our robot with the capability to act in an open world. To provide such ability, a robot vision system should be able to (i) identify whether an instance does not belong to the set of known categories (i.e., open set recognition), and (ii) extend its knowledge to learn new classes over time (i.e., incremental learning). In this work, we show how we can boost the performance of deep open world recognition algorithms by means of a new loss formulation enforcing a global to local clustering of class-specific features. In particular, a first loss term, i.e., global clustering, forces the network to map samples closer to the class centroid they belong to while the second one, local clustering, shapes the representation space in such a way that samples of the same class get closer in the representation space while pushing away neighbours belonging to other classes. Moreover, we propose a strategy to learn class-specific rejection thresholds, instead of heuristically estimating a single global threshold, as in previous works. Experiments on three benchmarks show the effectiveness of our approach.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2020
			
	Titolo del periodico (Journal title)
	
				IEEE ROBOTICS AND AUTOMATION LETTERS
			
	Numero e parte del fascicolo (Issue number and part)
	
				4
			
	DOI
	
				https://dx.doi.org/10.1109/LRA.2020.3010753
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-85089352603
			
	Codice WOS (WOS identifier)
	
				WOS:000554894900022
			
	Tutti gli autori
	
						Fontanel, D.; Cermelli, F.; Mancini, M.; Bulo, S. R.; Ricci, E.; Caputo, B.
					
	Citazione
	
				Boosting Deep Open World Recognition by Clustering / Fontanel, D., Cermelli, F., Mancini, M., Bulo, S.R., Ricci, E., Caputo, B.. - In: IEEE ROBOTICS AND AUTOMATION LETTERS. - ISSN 2377-3766. - 5:4(2020), pp. 5985-5992. [10.1109/LRA.2020.3010753]
			
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

File	Dimensione	Formato
09145605.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.12 MB Formato Adobe PDF Visualizza/Apri	1.12 MB	Adobe PDF	Visualizza/Apri