Optimized graph learning using partial tags and multiple features for image and video annotation

IRIS

In multimedia annotation, due to the time constraints and the tediousness of manual tagging, it is quite common to utilize both tagged and untagged data to improve the performance of supervised learning when only limited tagged training data are available. This is often done by adding a geometry-based regularization term in the objective function of a supervised learning model. In this case, a similarity graph is indispensable to exploit the geometrical relationships among the training data points, and the graph construction scheme essentially determines the performance of these graph-based learning algorithms. However, most of the existing works construct the graph empirically and are usually based on a single feature without using the label information. In this paper, we propose a semi-supervised annotation approach by learning an optimized graph (OGL) from multi-cues (i.e., partial tags and multiple features), which can more accurately embed the relationships among the data points. ...

Optimized graph learning using partial tags and multiple features for image and video annotation

Song, Jingkuan;Gao, L.;Nie, F.;Shen, H. T.;Yan, Yan;Sebe, Niculae

2016-01-01

Abstract

In multimedia annotation, due to the time constraints and the tediousness of manual tagging, it is quite common to utilize both tagged and untagged data to improve the performance of supervised learning when only limited tagged training data are available. This is often done by adding a geometry-based regularization term in the objective function of a supervised learning model. In this case, a similarity graph is indispensable to exploit the geometrical relationships among the training data points, and the graph construction scheme essentially determines the performance of these graph-based learning algorithms. However, most of the existing works construct the graph empirically and are usually based on a single feature without using the label information. In this paper, we propose a semi-supervised annotation approach by learning an optimized graph (OGL) from multi-cues (i.e., partial tags and multiple features), which can more accurately embed the relationships among the data points. ...

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2016
			
	Titolo del periodico (Journal title)
	
				IEEE TRANSACTIONS ON IMAGE PROCESSING
			
	Numero e parte del fascicolo (Issue number and part)
	
				11
			
	DOI
	
				https://dx.doi.org/10.1109/TIP.2016.2601260
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-84991618952
			
	Codice WOS (WOS identifier)
	
				WOS:000386148400001
			
	Tutti gli autori
	
						Song, Jingkuan; Gao, L.; Nie, F.; Shen, H. T.; Yan, Yan; Sebe, Niculae
					
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/166685

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

123

98

133

social impact