Kolmogorov-Smirnov Test for Feature Selection in Emotion recognition from Speech

IRIS

Automatic emotion recognition from speech is limited by the ability to discover the relevant predicting features. The common approach is to extract a very large set of features over a generally long analysis time window. In this paper we investigate the applicability of two-sample Kolmogorov-Smirnov statistical test (KST) to the problem of segmental speech emotion recognition. We train emotion classifiers for each speech segment within an utterance. The segment labels are then combined to predict the dominant emotion label. Our findings show that KST can be successfully used to extract statistically relevant features. KST criterion is used to optimize the parameters of the statistical segmental analysis, namely the window segment size and shift. We carry out seven binary class emotion classification experiments on the Emo-DB and evaluate the impact of the segmental analysis and emotion-specific feature selection.

Kolmogorov-Smirnov Test for Feature Selection in Emotion recognition from Speech

Ivanou, Aliaksei;Riccardi, Giuseppe

2012-01-01

Abstract

Automatic emotion recognition from speech is limited by the ability to discover the relevant predicting features. The common approach is to extract a very large set of features over a generally long analysis time window. In this paper we investigate the applicability of two-sample Kolmogorov-Smirnov statistical test (KST) to the problem of segmental speech emotion recognition. We train emotion classifiers for each speech segment within an utterance. The segment labels are then combined to predict the dominant emotion label. Our findings show that KST can be successfully used to extract statistically relevant features. KST criterion is used to optimize the parameters of the statistical segmental analysis, namely the window segment size and shift. We carry out seven binary class emotion classification experiments on the Emo-DB and evaluate the impact of the segmental analysis and emotion-specific feature selection.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di deposito (Filing date)
	
				2012
			
	Titolo del volume (Proceedings title)
	
				ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
			
	Luogo di edizione (Place of publication)
	
				Washington
			
	Casa editrice (Publisher)
	
				IEEE
			
	ISBN
	
				9781467300469
			
	DOI
	
				https://dx.doi.org/10.1109/ICASSP.2012.6289074
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-84867593857
			
	Codice WOS (WOS identifier)
	
				WOS:000312381405050
			
	Tutti gli autori
	
						Ivanou, Aliaksei; Riccardi, Giuseppe
					
	Appare nelle tipologie:
	
				04.3 Poster presentato a convegno (Poster presented at Conference or Workshop)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/92177

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

44

33

ND

social impact