CV-C3D: Action recognition on compressed videos with convolutional 3D networks

Dos Santos, S. F.; Sebe, N.; Almeida, J.

doi:10.1109/SIBGRAPI.2019.00012

Action recognition in videos has gained substantial attention from the computer vision community due to the wide range of possible applications. Recent works have addressed this problem with deep learning methods. The main limitation of existing approaches is their difficulty to learn temporal dynamics due to the high computational load demanded for processing huge amounts of data required to train a model. To overcome this problem, we propose a Compressed Video Convolutional 3D network (CV-C3D). It exploits information from the compressed representation of a video in order to avoid the high computational cost for fully decoding the video stream. The speed up of the computation enables our network to use 3D convolutions for capturing the temporal context efficiently. Our network has the lowest computational complexity among all the compared approaches. Results of our approach in the task of action recognition on two public benchmarks, UCF-101 and HMDB-51, were comparable to the baselines, with the advantage of running at faster inference speed.

CV-C3D: Action recognition on compressed videos with convolutional 3D networks / Dos Santos, S. F.; Sebe, N.; Almeida, J.. - (2019), pp. 24-30. (Intervento presentato al convegno 32nd SIBGRAPI Conference on Graphics, Patterns and Images, SIBGRAPI 2019 tenutosi a CINCO - Riocentro Convention and Event Center, bra nel 2019) [10.1109/SIBGRAPI.2019.00012].

CV-C3D: Action recognition on compressed videos with convolutional 3D networks

Dos Santos S. F.;Sebe N.;Almeida J.

2019-01-01

Abstract

Action recognition in videos has gained substantial attention from the computer vision community due to the wide range of possible applications. Recent works have addressed this problem with deep learning methods. The main limitation of existing approaches is their difficulty to learn temporal dynamics due to the high computational load demanded for processing huge amounts of data required to train a model. To overcome this problem, we propose a Compressed Video Convolutional 3D network (CV-C3D). It exploits information from the compressed representation of a video in order to avoid the high computational cost for fully decoding the video stream. The speed up of the computation enables our network to use 3D convolutions for capturing the temporal context efficiently. Our network has the lowest computational complexity among all the compared approaches. Results of our approach in the task of action recognition on two public benchmarks, UCF-101 and HMDB-51, were comparable to the baselines, with the advantage of running at faster inference speed.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2019
			
	Titolo del volume (Proceedings title)
	
				Proceedings - 32nd Conference on Graphics, Patterns and Images, SIBGRAPI 2019
			
	Luogo di edizione (Place of publication)
	
				New York
			
	Casa editrice (Publisher)
	
				Institute of Electrical and Electronics Engineers Inc.
			
	ISBN
	
				978-1-7281-5227-1
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85077063318
			
	Codice WOS (WOS identifier)
	
				WOS:000521826400004
			
	Tutti gli autori
	
						Dos Santos, S. F.; Sebe, N.; Almeida, J.
					
	Citazione
	
				CV-C3D: Action recognition on compressed videos with convolutional 3D networks / Dos Santos, S. F.; Sebe, N.; Almeida, J.. - (2019), pp. 24-30. (Intervento presentato al  convegno 32nd SIBGRAPI Conference on Graphics, Patterns and Images, SIBGRAPI 2019 tenutosi a CINCO - Riocentro Convention and Event Center, bra nel 2019) [10.1109/SIBGRAPI.2019.00012].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
08919874.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 318.54 kB Formato Adobe PDF Visualizza/Apri	318.54 kB	Adobe PDF	Visualizza/Apri