Boosting VLAD with double assignment using deep features for action recognition in videos

Duta, Ionut C.; Nguyen, Tuan A.; Aizawa, Kiyoharu; Ionescu, Bogdan; Sebe, Nicu

doi:10.1109/ICPR.2016.7899964

The encoding method is an important factor for an action recognition pipeline. One of the key points for the encoding method is the assignment step. A very widely used super-vector encoding method is the vector of locally aggregated descriptors (VLAD), with very competitive results in many tasks. However, it considers only hard assignment and the criteria for the assignment is performed only from the features side, by looking for which visual word the features are voting. In this work we propose to encode deep features for videos using a double assignment VLAD (DA-VLAD). In addition to the traditional assignment for VLAD we perform a second assignment by taking into account the perspective from the codebook side: which are the nearest features to a visual word and not only which is the nearest centroid for the features as the standard assignment. Another important factor for the performance of an action recognition system is the feature extraction step. Recently, deep features obtained...

Boosting VLAD with double assignment using deep features for action recognition in videos / Duta, I.C., Nguyen, T.A., Aizawa, K., Ionescu, B., Sebe, N.. - 0:(2016), pp. 2210-2215. (23rd International Conference on Pattern Recognition, ICPR 2016 Cancun Center, mex 2016) [10.1109/ICPR.2016.7899964].

Boosting VLAD with double assignment using deep features for action recognition in videos

Duta, Ionut C.;Nguyen, Tuan A.;Aizawa, Kiyoharu;Ionescu, Bogdan;Sebe, Nicu

2016-01-01

Abstract

The encoding method is an important factor for an action recognition pipeline. One of the key points for the encoding method is the assignment step. A very widely used super-vector encoding method is the vector of locally aggregated descriptors (VLAD), with very competitive results in many tasks. However, it considers only hard assignment and the criteria for the assignment is performed only from the features side, by looking for which visual word the features are voting. In this work we propose to encode deep features for videos using a double assignment VLAD (DA-VLAD). In addition to the traditional assignment for VLAD we perform a second assignment by taking into account the perspective from the codebook side: which are the nearest features to a visual word and not only which is the nearest centroid for the features as the standard assignment. Another important factor for the performance of an action recognition system is the feature extraction step. Recently, deep features obtained...

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2016
			
	Titolo del volume (Proceedings title)
	
				Proceedings - International Conference on Pattern Recognition
			
	Luogo di edizione (Place of publication)
	
				Piscataway, NJ
			
	Casa editrice (Publisher)
	
				Institute of Electrical and Electronics Engineers Inc.
			
	ISBN
	
				9781509048472
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85019134771
			
	Codice WOS (WOS identifier)
	
				WOS:000406771302034
			
	Tutti gli autori
	
						Duta, Ionut C.; Nguyen, Tuan A.; Aizawa, Kiyoharu; Ionescu, Bogdan; Sebe, Nicu
					
	Citazione
	
				Boosting VLAD with double assignment using deep features for action recognition in videos / Duta, I.C., Nguyen, T.A., Aizawa, K., Ionescu, B., Sebe, N.. - 0:(2016), pp. 2210-2215. (23rd International Conference on Pattern Recognition, ICPR 2016 Cancun Center, mex 2016) [10.1109/ICPR.2016.7899964].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
07899964.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 333.61 kB Formato Adobe PDF Visualizza/Apri	333.61 kB	Adobe PDF	Visualizza/Apri