Comparing Stochastic Approaches to Spoken Language Understanding in Multiple Languages

IRIS

One of the first steps in building a spoken language understanding (SLU) module for dialogue systems is the extraction of flat concepts out of a given word sequence, usually provided by an automatic speech recognition (ASR) system. In this paper, six different modeling approaches are investigated to tackle the task of concept tagging. These methods include classical, well-known generative and discriminative methods like Finite State Transducers (FSTs), Statistical Machine Translation (SMT), Maximum Entropy Markov Models (MEMMs), or Support Vector Machines (SVMs) as well as techniques recently applied to natural language processing such as Conditional Random Fields (CRFs) or Dynamic Bayesian Networks (DBNs). Following a detailed description of the models, experimental and comparative results are presented on three corpora in different languages and with different complexity. The French MEDIA corpus has already been exploited during an evaluation campaign and so a direct comparison with ...

Comparing Stochastic Approaches to Spoken Language Understanding in Multiple Languages

S. Hahn;Dinarelli, Marco;C. Raymond;F. Lefevre;P. Lehnen;R. D. Mori;Moschitti, Alessandro;H. Ney;Riccardi, Giuseppe

2011-01-01

Abstract

One of the first steps in building a spoken language understanding (SLU) module for dialogue systems is the extraction of flat concepts out of a given word sequence, usually provided by an automatic speech recognition (ASR) system. In this paper, six different modeling approaches are investigated to tackle the task of concept tagging. These methods include classical, well-known generative and discriminative methods like Finite State Transducers (FSTs), Statistical Machine Translation (SMT), Maximum Entropy Markov Models (MEMMs), or Support Vector Machines (SVMs) as well as techniques recently applied to natural language processing such as Conditional Random Fields (CRFs) or Dynamic Bayesian Networks (DBNs). Following a detailed description of the models, experimental and comparative results are presented on three corpora in different languages and with different complexity. The French MEDIA corpus has already been exploited during an evaluation campaign and so a direct comparison with ...

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2011
			
	Titolo del periodico (Journal title)
	
				IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING
			
	Numero e parte del fascicolo (Issue number and part)
	
				6
			
	DOI
	
				https://dx.doi.org/10.1109/TASL.2010.2093520
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-79957695367
			
	Codice WOS (WOS identifier)
	
				WOS:000293702300010
			
	Tutti gli autori
	
						S., Hahn; Dinarelli, Marco; C., Raymond; F., Lefevre; P., Lehnen; R. D., Mori; Moschitti, Alessandro; H., Ney; Riccardi, Giuseppe
					
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

File	Dimensione	Formato
IEEETSLP10-MultSLU.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 865.87 kB Formato Adobe PDF Visualizza/Apri	865.87 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/85113

Citazioni

ND

82

58

102

social impact