An incremental turn-taking model for task-oriented dialog systems

Coman, A. C.; Yoshino, K.; Murase, Y.; Nakamura, S.; Riccardi, G.

doi:10.21437/Interspeech.2019-1826

In a human-machine dialog scenario, deciding the appropriate time for the machine to take the turn is an open research problem. In contrast, humans engaged in conversations are able to timely decide when to interrupt the speaker for competitive or non-competitive reasons. In state-of-the-art turn-by-turn dialog systems the decision on the next dialog action is taken at the end of the utterance. In this paper, we propose a token-by-token prediction of the dialog state from incremental transcriptions of the user utterance. To identify the point of maximal understanding in an ongoing utterance, we a) implement an incremental Dialog State Tracker which is updated on a token basis (iDST) b) re-label the Dialog State Tracking Challenge 2 (DSTC2) dataset and c) adapt it to the incremental turn-taking experimental scenario. The re-labeling consists of assigning a binary value to each token in the user utterance that allows to identify the appropriate point for taking the turn. Finally, we impl...

In a human-machine dialog scenario, deciding the appropriate time for the machine to take the turn is an open research problem. In contrast, humans engaged in conversations are able to timely decide when to interrupt the speaker for competitive or non-competitive reasons. In state-of-the-art turn-by-turn dialog systems the decision on the next dialog action is taken at the end of the utterance. In this paper, we propose a token-by-token prediction of the dialog state from incremental transcriptions of the user utterance. To identify the point of maximal understanding in an ongoing utterance, we a) implement an incremental Dialog State Tracker which is updated on a token basis (iDST) b) re-label the Dialog State Tracking Challenge 2 (DSTC2) dataset and c) adapt it to the incremental turn-taking experimental scenario. The re-labeling consists of assigning a binary value to each token in the user utterance that allows to identify the appropriate point for taking the turn. Finally, we implement an incremental Turn Taking Decider (iTTD) that is trained on these new labels for the turn-taking decision. We show that the proposed model can achieve a better performance compared to a deterministic handcrafted turn-taking algorithm.

An incremental turn-taking model for task-oriented dialog systems / Coman, A. C.; Yoshino, K.; Murase, Y.; Nakamura, S.; Riccardi, G.. - 2019-:(2019), pp. 4155-4159. ( 20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019 Graz 15th-19th September 2019) [10.21437/Interspeech.2019-1826].

An incremental turn-taking model for task-oriented dialog systems

Coman A. C.;Yoshino K.;Murase Y.;Nakamura S.;Riccardi G.

2019-01-01

Abstract

In a human-machine dialog scenario, deciding the appropriate time for the machine to take the turn is an open research problem. In contrast, humans engaged in conversations are able to timely decide when to interrupt the speaker for competitive or non-competitive reasons. In state-of-the-art turn-by-turn dialog systems the decision on the next dialog action is taken at the end of the utterance. In this paper, we propose a token-by-token prediction of the dialog state from incremental transcriptions of the user utterance. To identify the point of maximal understanding in an ongoing utterance, we a) implement an incremental Dialog State Tracker which is updated on a token basis (iDST) b) re-label the Dialog State Tracking Challenge 2 (DSTC2) dataset and c) adapt it to the incremental turn-taking experimental scenario. The re-labeling consists of assigning a binary value to each token in the user utterance that allows to identify the appropriate point for taking the turn. Finally, we impl...

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2019
			
	Titolo del volume (Proceedings title)
	
				Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019
			
	Luogo di edizione (Place of publication)
	
				Baixas
			
	Casa editrice (Publisher)
	
				International Speech Communication Association
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85074716237
			
	Codice WOS (WOS identifier)
	
				WOS:000831796404060
			
	Tutti gli autori
	
						Coman, A. C.; Yoshino, K.; Murase, Y.; Nakamura, S.; Riccardi, G.
					
	Citazione
	
				An incremental turn-taking model for task-oriented dialog systems / Coman, A. C.; Yoshino, K.; Murase, Y.; Nakamura, S.; Riccardi, G.. - 2019-:(2019), pp. 4155-4159. ( 20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019 Graz 15th-19th September 2019) [10.21437/Interspeech.2019-1826].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
1826.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 289.51 kB Formato Adobe PDF Visualizza/Apri	289.51 kB	Adobe PDF	Visualizza/Apri