In this work, we propose a Frequency-based Action Descriptor (FADE) to represent human actions. In robotics, with the development of Programming by Demonstration (PbD) methods, representing and recognizing large sets of actions has become crucial to build autonomous systems that learn from humans. The FADE descriptor leverages Fast Fourier Transform (FFT) for action representation and is combined with the Manhattan distance for measuring similarities between actions. It is characterized by a low time and space complexity and is particularly suitable for classification of human actions. For clustering problems, we propose a modified version of FADE, called Uncompressed-FADE (U-FADE), which performs well in combination with Spectral Clustering algorithms at the price of a reduced compression. We compare FADE with action descriptors based on Singular Value Decomposition (SVD) and Hidden Markov Models (HMM) on the entire HDM05 motion capture database. Despite the high dimensionality of the problem, we obtained on the entire database a promising recognition rate of 78% combining FADE with a simple 1-NN classification algorithm. Furthermore, we achieved a rate of 98% on a small action set and 88% on a medium action set.

Encoding human actions with a frequency domain approach / Shah, D.; Falco, P.; Saveriano, M.; Lee, D.. - 2016-:(2016), pp. 5304-5311. (Intervento presentato al convegno 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2016 tenutosi a Daejeon Convention Center, kor nel 2016) [10.1109/IROS.2016.7759780].

Encoding human actions with a frequency domain approach

Saveriano M.;
2016-01-01

Abstract

In this work, we propose a Frequency-based Action Descriptor (FADE) to represent human actions. In robotics, with the development of Programming by Demonstration (PbD) methods, representing and recognizing large sets of actions has become crucial to build autonomous systems that learn from humans. The FADE descriptor leverages Fast Fourier Transform (FFT) for action representation and is combined with the Manhattan distance for measuring similarities between actions. It is characterized by a low time and space complexity and is particularly suitable for classification of human actions. For clustering problems, we propose a modified version of FADE, called Uncompressed-FADE (U-FADE), which performs well in combination with Spectral Clustering algorithms at the price of a reduced compression. We compare FADE with action descriptors based on Singular Value Decomposition (SVD) and Hidden Markov Models (HMM) on the entire HDM05 motion capture database. Despite the high dimensionality of the problem, we obtained on the entire database a promising recognition rate of 78% combining FADE with a simple 1-NN classification algorithm. Furthermore, we achieved a rate of 98% on a small action set and 88% on a medium action set.
2016
IEEE International Conference on Intelligent Robots and Systems
Piscataway, New Jersey, USA
Institute of Electrical and Electronics Engineers Inc.
978-1-5090-3762-9
Shah, D.; Falco, P.; Saveriano, M.; Lee, D.
Encoding human actions with a frequency domain approach / Shah, D.; Falco, P.; Saveriano, M.; Lee, D.. - 2016-:(2016), pp. 5304-5311. (Intervento presentato al convegno 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2016 tenutosi a Daejeon Convention Center, kor nel 2016) [10.1109/IROS.2016.7759780].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/331031
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 6
social impact