Egocentric activity recognition has recently generated great popularity in computer vision due to its widespread applications in egocentric video analysis. However, it poses new challenges comparing to the conventional third-person activity recognition tasks, which are caused by significant body shaking, varied lengths, and poor recoding quality, etc. To handle these challenges, in this paper, we propose deep appearance and motion learning (DAML) for egocentric activity recognition, which leverages the great strength of deep learning networks in feature learning. In contrast to hand-crafted visual features or pre-trained convolutional neural network (CNN) features with limited generality to new egocentric videos, the proposed DAML is built on the deep autoencoder (DAE), and directly extracts appearance and motion feature, the main cue of activities, from egocentric videos. The DAML takes advantages of the great effectiveness and efficiency of the DAE in unsupervised feature learning, w...

Deep appearance and motion learning for egocentric activity recognition / Wang, Xuanhan; Gao, Lianli; Song, Jingkuan; Zhen, Xiantong; Sebe, Nicu; Shen, Heng Tao. - In: NEUROCOMPUTING. - ISSN 0925-2312. - 275:(2018), pp. 438-447. [10.1016/j.neucom.2017.08.063]

Deep appearance and motion learning for egocentric activity recognition

Song, Jingkuan;Sebe, Nicu;
2018-01-01

Abstract

Egocentric activity recognition has recently generated great popularity in computer vision due to its widespread applications in egocentric video analysis. However, it poses new challenges comparing to the conventional third-person activity recognition tasks, which are caused by significant body shaking, varied lengths, and poor recoding quality, etc. To handle these challenges, in this paper, we propose deep appearance and motion learning (DAML) for egocentric activity recognition, which leverages the great strength of deep learning networks in feature learning. In contrast to hand-crafted visual features or pre-trained convolutional neural network (CNN) features with limited generality to new egocentric videos, the proposed DAML is built on the deep autoencoder (DAE), and directly extracts appearance and motion feature, the main cue of activities, from egocentric videos. The DAML takes advantages of the great effectiveness and efficiency of the DAE in unsupervised feature learning, w...
2018
Wang, Xuanhan; Gao, Lianli; Song, Jingkuan; Zhen, Xiantong; Sebe, Nicu; Shen, Heng Tao
Deep appearance and motion learning for egocentric activity recognition / Wang, Xuanhan; Gao, Lianli; Song, Jingkuan; Zhen, Xiantong; Sebe, Nicu; Shen, Heng Tao. - In: NEUROCOMPUTING. - ISSN 0925-2312. - 275:(2018), pp. 438-447. [10.1016/j.neucom.2017.08.063]
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0925231217314935-main.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Altra licenza (Other type of license)
Dimensione 1.78 MB
Formato Adobe PDF
1.78 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/193327
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 43
  • ???jsp.display-item.citation.isi??? 36
  • OpenAlex ND
social impact