Optimal graph learning with partial tags and multiple features for image and video annotation

IRIS

In multimedia annotation, due to the time constraints and the tediousness of manual tagging, it is quite common to utilize both tagged and untagged data to improve the performance of supervised learning when only limited tagged training data are available. This is often done by adding a geometrically based regularization term in the objective function of a supervised learning model. In this case, a similarity graph is indispensable to exploit the geometrical relationships among the training data points, and the graph construction scheme essentially determines the performance of these graph-based learning algorithms. However, most of the existing works construct the graph empirically and are usually based on a single feature without using the label information. In this paper, we propose a semi-supervised annotation approach by learning an optimal graph (OGL) from multi-cues (i.e., partial tags and multiple features) which can more accurately embed the relationships among the data points. We further extend our model to address out-of-sample and noisy label issues. Extensive experiments on four public datasets show the consistent superiority of OGL over state-of-the-art methods by up to 12% in terms of mean average precision.

Optimal graph learning with partial tags and multiple features for image and video annotation / Gao, Lianli; Song, Jingkuan; Nie, Feiping; Yan, Yan; Sebe, Niculae; Shen, Heng Tao. - (2015), pp. 4371-4379. (Intervento presentato al convegno CVPR 2015 tenutosi a Boston nel 7th-12th June 2015) [10.1109/CVPR.2015.7299066].

Optimal graph learning with partial tags and multiple features for image and video annotation

Gao, Lianli;Song, Jingkuan;Nie, Feiping;Yan, Yan;Sebe, Niculae;Shen, Heng Tao

2015-01-01

Abstract

In multimedia annotation, due to the time constraints and the tediousness of manual tagging, it is quite common to utilize both tagged and untagged data to improve the performance of supervised learning when only limited tagged training data are available. This is often done by adding a geometrically based regularization term in the objective function of a supervised learning model. In this case, a similarity graph is indispensable to exploit the geometrical relationships among the training data points, and the graph construction scheme essentially determines the performance of these graph-based learning algorithms. However, most of the existing works construct the graph empirically and are usually based on a single feature without using the label information. In this paper, we propose a semi-supervised annotation approach by learning an optimal graph (OGL) from multi-cues (i.e., partial tags and multiple features) which can more accurately embed the relationships among the data points. We further extend our model to address out-of-sample and noisy label issues. Extensive experiments on four public datasets show the consistent superiority of OGL over state-of-the-art methods by up to 12% in terms of mean average precision.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
			2015
		
	Titolo del volume (Proceedings title)
	
			2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
		
	Luogo di edizione (Place of publication)
	
			Piscataway, NJ
		
	Casa editrice (Publisher)
	
			IEEE Computer society
		
	ISBN
	
			978-1-4673-6964-0
		
	Codice Scopus (Scopus Identifier)
	
			2-s2.0-84959233699
		
	Codice WOS (WOS identifier)
	
			WOS:000387959204045
		
	Tutti gli autori
	
			Gao, Lianli; Song, Jingkuan; Nie, Feiping; Yan, Yan; Sebe, Niculae; Shen, Heng Tao
		
	Citazione
	
			Optimal graph learning with partial tags and multiple features for image and video annotation / Gao, Lianli; Song, Jingkuan; Nie, Feiping; Yan, Yan; Sebe, Niculae; Shen, Heng Tao. - (2015), pp. 4371-4379. (Intervento presentato al  convegno CVPR 2015 tenutosi a Boston nel 7th-12th June 2015) [10.1109/CVPR.2015.7299066].
		
	Appare nelle tipologie:
	
			04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
07299066.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 588.77 kB Formato Adobe PDF Visualizza/Apri	588.77 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/115053

Citazioni

ND

70

52

social impact