Dynamic word recommendation to obtain diverse crowdsourced paraphrases of user utterances

IRIS

Building task-oriented bots requires mapping a user utterance to an intent with its associated entities to serve the request. Doing so is not easy since it requires large quantities of high-quality and diverse training data to learn how to map all possible variations of utterances with the same intent. Crowdsourcing may be an effective, inexpensive, and scalable technique for collecting such large datasets. However, the diversity of the results suffers from the priming effect (i.e. workers are more likely to use the words in the sentence we are asking to paraphrase). In this paper, we leverage priming as an opportunity rather than a threat: we dynamically generate word suggestions to motivate crowd workers towards producing diverse utterances. The key challenge is to make suggestions that can improve diversity without resulting in semantically invalid paraphrases. To achieve this, we propose a probabilistic model that generates continuously improved versions of word suggestions that balance diversity and semantic relevance. Our experiments show that the proposed approach improves the diversity of crowdsourced paraphrases.

Dynamic word recommendation to obtain diverse crowdsourced paraphrases of user utterances / Yaghoub-Zadeh-Fard, Mohammad-Ali; Benatallah, Boualem; Casati, Fabio; Barukh, Moshe Chai; Zamanirad, Shayan. - (2020), pp. 55-66. ( 25th ACM International Conference on Intelligent User Interfaces, IUI 2020 CAGLIARI 17 - 20 March, 2020) [10.1145/3377325.3377486].

Dynamic word recommendation to obtain diverse crowdsourced paraphrases of user utterances

Yaghoub-Zadeh-Fard, Mohammad-Ali;Benatallah, Boualem;Casati, Fabio;Barukh, Moshe Chai;Zamanirad, Shayan

2020-01-01

Abstract

Building task-oriented bots requires mapping a user utterance to an intent with its associated entities to serve the request. Doing so is not easy since it requires large quantities of high-quality and diverse training data to learn how to map all possible variations of utterances with the same intent. Crowdsourcing may be an effective, inexpensive, and scalable technique for collecting such large datasets. However, the diversity of the results suffers from the priming effect (i.e. workers are more likely to use the words in the sentence we are asking to paraphrase). In this paper, we leverage priming as an opportunity rather than a threat: we dynamically generate word suggestions to motivate crowd workers towards producing diverse utterances. The key challenge is to make suggestions that can improve diversity without resulting in semantically invalid paraphrases. To achieve this, we propose a probabilistic model that generates continuously improved versions of word suggestions that balance diversity and semantic relevance. Our experiments show that the proposed approach improves the diversity of crowdsourced paraphrases.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2020
			
	Titolo del volume (Proceedings title)
	
				Proceedings of the 25th International Conference on Intelligent User Interfaces
			
	Luogo di edizione (Place of publication)
	
				NEW YORK, NY, UNITED STATES
			
	Casa editrice (Publisher)
	
				ASSOC COMPUTING MACHINERY
			
	ISBN
	
				9781450371186
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85082447880
			
	Codice WOS (WOS identifier)
	
				WOS:001062821300010
			
	Tutti gli autori
	
						Yaghoub-Zadeh-Fard, Mohammad-Ali; Benatallah, Boualem; Casati, Fabio; Barukh, Moshe Chai; Zamanirad, Shayan
					
	Citazione
	
				Dynamic word recommendation to obtain diverse crowdsourced paraphrases of user utterances / Yaghoub-Zadeh-Fard, Mohammad-Ali; Benatallah, Boualem; Casati, Fabio; Barukh, Moshe Chai; Zamanirad, Shayan. - (2020), pp. 55-66. ( 25th ACM International Conference on Intelligent User Interfaces, IUI 2020 CAGLIARI 17 - 20 March, 2020) [10.1145/3377325.3377486].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
3377325.3377486-2.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 744.07 kB Formato Adobe PDF Visualizza/Apri	744.07 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/397752

Citazioni

ND

15

12

ND

social impact