The encoding method is an important factor for an action recognition pipeline. One of the key points for the encoding method is the assignment step. A very widely used super-vector encoding method is the vector of locally aggregated descriptors (VLAD), with very competitive results in many tasks. However, it considers only hard assignment and the criteria for the assignment is performed only from the features side, by looking for which visual word the features are voting. In this work we propose to encode deep features for videos using a double assignment VLAD (DA-VLAD). In addition to the traditional assignment for VLAD we perform a second assignment by taking into account the perspective from the codebook side: which are the nearest features to a visual word and not only which is the nearest centroid for the features as the standard assignment. Another important factor for the performance of an action recognition system is the feature extraction step. Recently, deep features obtained...

Boosting VLAD with double assignment using deep features for action recognition in videos / Duta, Ionut C.; Nguyen, Tuan A.; Aizawa, Kiyoharu; Ionescu, Bogdan; Sebe, Nicu. - 0:(2016), pp. 2210-2215. ( 23rd International Conference on Pattern Recognition, ICPR 2016 Cancun Center, mex 2016) [10.1109/ICPR.2016.7899964].

Boosting VLAD with double assignment using deep features for action recognition in videos

Duta, Ionut C.;Sebe, Nicu
2016-01-01

Abstract

The encoding method is an important factor for an action recognition pipeline. One of the key points for the encoding method is the assignment step. A very widely used super-vector encoding method is the vector of locally aggregated descriptors (VLAD), with very competitive results in many tasks. However, it considers only hard assignment and the criteria for the assignment is performed only from the features side, by looking for which visual word the features are voting. In this work we propose to encode deep features for videos using a double assignment VLAD (DA-VLAD). In addition to the traditional assignment for VLAD we perform a second assignment by taking into account the perspective from the codebook side: which are the nearest features to a visual word and not only which is the nearest centroid for the features as the standard assignment. Another important factor for the performance of an action recognition system is the feature extraction step. Recently, deep features obtained...
2016
Proceedings - International Conference on Pattern Recognition
Piscataway, NJ
Institute of Electrical and Electronics Engineers Inc.
9781509048472
Duta, Ionut C.; Nguyen, Tuan A.; Aizawa, Kiyoharu; Ionescu, Bogdan; Sebe, Nicu
Boosting VLAD with double assignment using deep features for action recognition in videos / Duta, Ionut C.; Nguyen, Tuan A.; Aizawa, Kiyoharu; Ionescu, Bogdan; Sebe, Nicu. - 0:(2016), pp. 2210-2215. ( 23rd International Conference on Pattern Recognition, ICPR 2016 Cancun Center, mex 2016) [10.1109/ICPR.2016.7899964].
File in questo prodotto:
File Dimensione Formato  
07899964.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 333.61 kB
Formato Adobe PDF
333.61 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/193368
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 16
  • ???jsp.display-item.citation.isi??? 11
  • OpenAlex ND
social impact