In this work, we propose a Frequency-based Action Descriptor (FADE) to represent human actions. In robotics, with the development of Programming by Demonstration (PbD) methods, representing and recognizing large sets of actions has become crucial to build autonomous systems that learn from humans. The FADE descriptor leverages Fast Fourier Transform (FFT) for action representation and is combined with the Manhattan distance for measuring similarities between actions. It is characterized by a low time and space complexity and is particularly suitable for classification of human actions. For clustering problems, we propose a modified version of FADE, called Uncompressed-FADE (U-FADE), which performs well in combination with Spectral Clustering algorithms at the price of a reduced compression. We compare FADE with action descriptors based on Singular Value Decomposition (SVD) and Hidden Markov Models (HMM) on the entire HDM05 motion capture database. Despite the high dimensionality of the problem, we obtained on the entire database a promising recognition rate of 78% combining FADE with a simple 1-NN classification algorithm. Furthermore, we achieved a rate of 98% on a small action set and 88% on a medium action set.

Encoding Human Actions with a Frequency Domain Approach / Shah, D., Falco, P., Saveriano, M., Lee, D.. - (2016), pp. 5304-5311. (2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2016 Daejeon, Korea October 9-14, 2016) [10.1109/IROS.2016.7759780].

Encoding Human Actions with a Frequency Domain Approach

Saveriano M.;
2016-01-01

Abstract

In this work, we propose a Frequency-based Action Descriptor (FADE) to represent human actions. In robotics, with the development of Programming by Demonstration (PbD) methods, representing and recognizing large sets of actions has become crucial to build autonomous systems that learn from humans. The FADE descriptor leverages Fast Fourier Transform (FFT) for action representation and is combined with the Manhattan distance for measuring similarities between actions. It is characterized by a low time and space complexity and is particularly suitable for classification of human actions. For clustering problems, we propose a modified version of FADE, called Uncompressed-FADE (U-FADE), which performs well in combination with Spectral Clustering algorithms at the price of a reduced compression. We compare FADE with action descriptors based on Singular Value Decomposition (SVD) and Hidden Markov Models (HMM) on the entire HDM05 motion capture database. Despite the high dimensionality of the problem, we obtained on the entire database a promising recognition rate of 78% combining FADE with a simple 1-NN classification algorithm. Furthermore, we achieved a rate of 98% on a small action set and 88% on a medium action set.
2016
2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Piscataway, New Jersey, USA
Institute of Electrical and Electronics Engineers Inc.
978-1-5090-3762-9
Shah, D.; Falco, P.; Saveriano, M.; Lee, D.
Encoding Human Actions with a Frequency Domain Approach / Shah, D., Falco, P., Saveriano, M., Lee, D.. - (2016), pp. 5304-5311. (2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2016 Daejeon, Korea October 9-14, 2016) [10.1109/IROS.2016.7759780].
File in questo prodotto:
File Dimensione Formato  
Encoding_human_actions_with_a_frequency_domain_approach.pdf

Solo gestori archivio

Descrizione: 2016 IEEE/RSJ - conference paper
Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 517.07 kB
Formato Adobe PDF
517.07 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/331031
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 8
  • ???jsp.display-item.citation.isi??? 6
  • OpenAlex 15
social impact