To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs

Scialom, Thomas; Dray, Paul-Alexis; Lamprier, Sylvain; Piwowarski, Benjamin; Staiano, Jacopo

Due to the discrete nature of words, language GANs require to be optimized from rewards provided by discriminator networks, via reinforcement learning methods. This is a much harder setting than for continuous tasks, which enjoy gradient flows from discriminators to generators, usually leading to dramatic learning instabilities. However, we claim that this can be solved by making discriminator and generator networks cooperate to produce output sequences during training. These cooperative outputs, inherently built to obtain higher discrimination scores, not only provide denser rewards for training, but also form a more compact artificial set for discriminator training, hence improving its accuracy and stability. In this paper, we show that our SelfGAN framework, built on this cooperative principle, outperforms Teacher Forcing and obtains state-of-the-art results on two challenging tasks, Summarization and Question Generation.

To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs / Scialom, Thomas; Dray, Paul-Alexis; Lamprier, Sylvain; Piwowarski, Benjamin; Staiano, Jacopo. - 32:(2021), pp. 26585-26597. (Intervento presentato al convegno 35th Conference on Neural Information Processing Systems, NeurIPS 2021 tenutosi a Virtual, Online nel 6th-14th December 2021).

To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs

Scialom, Thomas;Dray, Paul-Alexis;Lamprier, Sylvain;Piwowarski, Benjamin;Staiano, Jacopo

2021-01-01

Abstract

Due to the discrete nature of words, language GANs require to be optimized from rewards provided by discriminator networks, via reinforcement learning methods. This is a much harder setting than for continuous tasks, which enjoy gradient flows from discriminators to generators, usually leading to dramatic learning instabilities. However, we claim that this can be solved by making discriminator and generator networks cooperate to produce output sequences during training. These cooperative outputs, inherently built to obtain higher discrimination scores, not only provide denser rewards for training, but also form a more compact artificial set for discriminator training, hence improving its accuracy and stability. In this paper, we show that our SelfGAN framework, built on this cooperative principle, outperforms Teacher Forcing and obtains state-of-the-art results on two challenging tasks, Summarization and Question Generation.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2021
			
	Titolo del volume (Proceedings title)
	
				Advances in Neural Information Processing Systems 34
			
	Luogo di edizione (Place of publication)
	
				San Mateo, CA
			
	Casa editrice (Publisher)
	
				Neural information processing systems foundation
			
	ISBN
	
				9781713845393
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85124063310
			
	Codice WOS (WOS identifier)
	
				WOS:000925183302046
			
	Tutti gli autori
	
						Scialom, Thomas; Dray, Paul-Alexis; Lamprier, Sylvain; Piwowarski, Benjamin; Staiano, Jacopo
					
	Citazione
	
				To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs / Scialom, Thomas; Dray, Paul-Alexis; Lamprier, Sylvain; Piwowarski, Benjamin; Staiano, Jacopo. - 32:(2021), pp. 26585-26597. (Intervento presentato al  convegno 35th Conference on Neural Information Processing Systems, NeurIPS 2021 tenutosi a Virtual, Online nel 6th-14th December 2021).
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
NeurIPS-2021-to-beam-or-not-to-beam-that-is-a-question-of-cooperation-for-language-gans-Paper.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 511.93 kB Formato Adobe PDF Visualizza/Apri	511.93 kB	Adobe PDF	Visualizza/Apri