Due to the discrete nature of words, language GANs require to be optimized from rewards provided by discriminator networks, via reinforcement learning methods. This is a much harder setting than for continuous tasks, which enjoy gradient flows from discriminators to generators, usually leading to dramatic learning instabilities. However, we claim that this can be solved by making discriminator and generator networks cooperate to produce output sequences during training. These cooperative outputs, inherently built to obtain higher discrimination scores, not only provide denser rewards for training, but also form a more compact artificial set for discriminator training, hence improving its accuracy and stability. In this paper, we show that our SelfGAN framework, built on this cooperative principle, outperforms Teacher Forcing and obtains state-of-the-art results on two challenging tasks, Summarization and Question Generation.

To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs / Scialom, Thomas; Dray, Paul-Alexis; Lamprier, Sylvain; Piwowarski, Benjamin; Staiano, Jacopo. - 32:(2021), pp. 26585-26597. (Intervento presentato al convegno 35th Conference on Neural Information Processing Systems, NeurIPS 2021 tenutosi a Virtual, Online nel 6th-14th December 2021).

To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs

Staiano, Jacopo
2021-01-01

Abstract

Due to the discrete nature of words, language GANs require to be optimized from rewards provided by discriminator networks, via reinforcement learning methods. This is a much harder setting than for continuous tasks, which enjoy gradient flows from discriminators to generators, usually leading to dramatic learning instabilities. However, we claim that this can be solved by making discriminator and generator networks cooperate to produce output sequences during training. These cooperative outputs, inherently built to obtain higher discrimination scores, not only provide denser rewards for training, but also form a more compact artificial set for discriminator training, hence improving its accuracy and stability. In this paper, we show that our SelfGAN framework, built on this cooperative principle, outperforms Teacher Forcing and obtains state-of-the-art results on two challenging tasks, Summarization and Question Generation.
2021
Advances in Neural Information Processing Systems 34
San Mateo, CA
Neural information processing systems foundation
9781713845393
Scialom, Thomas; Dray, Paul-Alexis; Lamprier, Sylvain; Piwowarski, Benjamin; Staiano, Jacopo
To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs / Scialom, Thomas; Dray, Paul-Alexis; Lamprier, Sylvain; Piwowarski, Benjamin; Staiano, Jacopo. - 32:(2021), pp. 26585-26597. (Intervento presentato al convegno 35th Conference on Neural Information Processing Systems, NeurIPS 2021 tenutosi a Virtual, Online nel 6th-14th December 2021).
File in questo prodotto:
File Dimensione Formato  
NeurIPS-2021-to-beam-or-not-to-beam-that-is-a-question-of-cooperation-for-language-gans-Paper.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 511.93 kB
Formato Adobe PDF
511.93 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/362926
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 16
  • ???jsp.display-item.citation.isi??? 1
  • OpenAlex ND
social impact