The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues

Testoni, Alberto; Bernardi, Raffaella

doi:10.18653/v1/2021.eacl-main.178

When training a model on referential dialogue guessing games, the best model is usually chosen based on its task success. We show that in the popular end-to-end approach, this choice prevents the model from learning to generate linguistically richer dialogues, since the acquisition of language proficiency takes longer than learning the guessing task. By comparing models playing different games (GuessWhat, GuessWhich, and Mutual Friends), we show that this discrepancy is model- and task-agnostic. We investigate whether and when better language quality could lead to higher task success. We show that in GuessWhat, models could increase their accuracy if they learn to ground, encode, and decode also words that do not occur frequently in the training set.

The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues / Testoni, Alberto; Bernardi, Raffaella. - ELETTRONICO. - (2021), pp. 2071-2082. ( 16th Conference of the European Chapter of the Associationfor Computational Linguistics, EACL 2021 Online 19-23 April 2021) [10.18653/v1/2021.eacl-main.178].

The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues

Testoni, Alberto;Bernardi, Raffaella

2021-01-01

Abstract

When training a model on referential dialogue guessing games, the best model is usually chosen based on its task success. We show that in the popular end-to-end approach, this choice prevents the model from learning to generate linguistically richer dialogues, since the acquisition of language proficiency takes longer than learning the guessing task. By comparing models playing different games (GuessWhat, GuessWhich, and Mutual Friends), we show that this discrepancy is model- and task-agnostic. We investigate whether and when better language quality could lead to higher task success. We show that in GuessWhat, models could increase their accuracy if they learn to ground, encode, and decode also words that do not occur frequently in the training set.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2021
			
	Titolo del volume (Proceedings title)
	
				Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics
			
	Luogo di edizione (Place of publication)
	
				209 N EIGHTH STREET, STROUDSBURG, PA 18360 USA
			
	Casa editrice (Publisher)
	
				ACL
			
	ISBN
	
				9781954085022
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85107285196
			
	Codice WOS (WOS identifier)
	
				WOS:000863557002014
			
	Tutti gli autori
	
						Testoni, Alberto; Bernardi, Raffaella
					
	Citazione
	
				The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues / Testoni, Alberto; Bernardi, Raffaella. - ELETTRONICO. - (2021), pp. 2071-2082. ( 16th Conference of the European Chapter of the Associationfor Computational Linguistics, EACL 2021 Online 19-23 April 2021) [10.18653/v1/2021.eacl-main.178].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
2021.eacl-main.178.pdf accesso aperto Descrizione: articolo principale Tipologia: Versione editoriale (Publisher’s layout) Licenza: Creative commons Dimensione 2.29 MB Formato Adobe PDF Visualizza/Apri	2.29 MB	Adobe PDF	Visualizza/Apri