Instance-Based On-line Language Model Adaptation

Bayer, Ali Orkan; Riccardi, Giuseppe

doi:10.21437/interspeech.2013-618

Language model (LM) adaptation is needed to improve the performance of language-based interaction systems. There are two important issues regarding LM adaptation; the selection of the target data set and the mathematical adaptation model. In the literature, usually statistics are drawn from the target data set (e.g. cache model) to augment (e.g. linearly) background statistical language models, as in the case of automatic speech recognition (ASR). Such models are relatively inexpensive to train, however they do not provide the necessary high-dimensional language context description needed for language-based interaction. Instance-based learning provides high-dimensional description of the lexical, semantic, or dialog context. In this paper, we present an instance-based approach to LM adaptation. We show that by retrieving similar instances from the training data and adapting the model with these instances, we can improve the performance of LMs. We propose two different similarity metrics for instance retrieval, edit distance and n-gram match score. We have performed instance-based adaptation on feed forward neural network LMs (NNLMs) to re-score n-best lists for ASR on the LUNA corpus, which includes conver sational speech. We have achieved signiﬁcant improvements in word error rate (WER) by using instance-based on-line LM adaptation on feed forward NNLMs.

Instance-Based On-line Language Model Adaptation / Bayer, A.O., Riccardi, G.. - STAMPA. - (2013), pp. 2688-2692. (14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013 Lyon France 25-29 August 2013) [10.21437/interspeech.2013-618].

Instance-Based On-line Language Model Adaptation

Bayer, Ali Orkan;Riccardi, Giuseppe

2013-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2013
			
	Titolo del volume (Proceedings title)
	
				Interspeech 2013
			
	Luogo di edizione (Place of publication)
	
				France
			
	Casa editrice (Publisher)
	
				International Speech Communication Association
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-84906274012
			
	Tutti gli autori
	
						Bayer, Ali Orkan; Riccardi, Giuseppe
					
	Citazione
	
				Instance-Based On-line Language Model Adaptation / Bayer, A.O., Riccardi, G.. - STAMPA. - (2013), pp. 2688-2692. (14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013 Lyon France 25-29 August 2013) [10.21437/interspeech.2013-618].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
bayer_is13.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 199.76 kB Formato Adobe PDF Visualizza/Apri	199.76 kB	Adobe PDF	Visualizza/Apri