Translating from languages without productive grammatical gender like English into gender-marked languages is a well-known difficulty for machines. This difficulty is also due to the fact that the training data on which models are built typically reflect the asymmetries of natural languages, gender bias included. Exclusively fed with textual data, machine translation is intrinsically constrained by the fact that the input sentence does not always contain clues about the gender identity of the referred human entities. But what happens with speech translation, where the input is an audio signal? Can audio provide additional information to reduce gender bias? We present the first thorough investigation of gender bias in speech translation, contributing with: i) the release of a benchmark useful for future studies, and ii) the comparison of different technologies (cascade and end-to-end) on two language directions (English-Italian/French).

Gender in Danger? Evaluating Speech Translation Technology on the MuST-SHE Corpus / Bentivogli, Luisa; Savoldi, Beatrice; Negri, Matteo; Di Gangi Mattia, A.; Cattoni, Roldano; Turchi, Marco. - ELETTRONICO. - (2020), pp. 6923-6933. (Intervento presentato al convegno ACL tenutosi a Seattle - Online nel July) [10.18653/v1/2020.acl-main.619].

Gender in Danger? Evaluating Speech Translation Technology on the MuST-SHE Corpus

Savoldi Beatrice;Di Gangi Mattia A.;
2020-01-01

Abstract

Translating from languages without productive grammatical gender like English into gender-marked languages is a well-known difficulty for machines. This difficulty is also due to the fact that the training data on which models are built typically reflect the asymmetries of natural languages, gender bias included. Exclusively fed with textual data, machine translation is intrinsically constrained by the fact that the input sentence does not always contain clues about the gender identity of the referred human entities. But what happens with speech translation, where the input is an audio signal? Can audio provide additional information to reduce gender bias? We present the first thorough investigation of gender bias in speech translation, contributing with: i) the release of a benchmark useful for future studies, and ii) the comparison of different technologies (cascade and end-to-end) on two language directions (English-Italian/French).
2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
Online
Association for Computational Linguistics
Bentivogli, Luisa; Savoldi, Beatrice; Negri, Matteo; Di Gangi Mattia, A.; Cattoni, Roldano; Turchi, Marco
Gender in Danger? Evaluating Speech Translation Technology on the MuST-SHE Corpus / Bentivogli, Luisa; Savoldi, Beatrice; Negri, Matteo; Di Gangi Mattia, A.; Cattoni, Roldano; Turchi, Marco. - ELETTRONICO. - (2020), pp. 6923-6933. (Intervento presentato al convegno ACL tenutosi a Seattle - Online nel July) [10.18653/v1/2020.acl-main.619].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/335506
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact