Contrastive explanations, which indicate why an AI system produced one output (the target) instead of another (the foil), are widely recognized in explainable AI as more informative and interpretable than standard explanations. However, obtaining such explanations for speech-to-text (S2T) generative models remains an open challenge. Adopting a feature attribution framework, we propose the first method to obtain contrastive explanations in S2T by analyzing how specific regions of the input spectrogram influence the choice between alternative outputs. Through a case study on gender translation in speech translation, we show that our method accurately identifies the audio features that drive the selection of one gender over another.

The Unheard Alternative: Contrastive Explanations for Speech-to-Text Models / Conti, Lina; Fucci, Dennis; Gaido, Marco; Negri, Matteo; Wisniewski, Guillaume; Bentivogli, Luisa. - ELETTRONICO. - (2025), pp. 398-414. ( BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP Suzhou 9th November 2025) [10.18653/v1/2025.blackboxnlp-1.23].

The Unheard Alternative: Contrastive Explanations for Speech-to-Text Models

Lina Conti
Primo
;
Dennis Fucci
Secondo
;
Marco Gaido;
2025-01-01

Abstract

Contrastive explanations, which indicate why an AI system produced one output (the target) instead of another (the foil), are widely recognized in explainable AI as more informative and interpretable than standard explanations. However, obtaining such explanations for speech-to-text (S2T) generative models remains an open challenge. Adopting a feature attribution framework, we propose the first method to obtain contrastive explanations in S2T by analyzing how specific regions of the input spectrogram influence the choice between alternative outputs. Through a case study on gender translation in speech translation, we show that our method accurately identifies the audio features that drive the selection of one gender over another.
2025
Proceedings of the 8th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP
Suzhou
Association for Computational Linguistics
Conti, Lina; Fucci, Dennis; Gaido, Marco; Negri, Matteo; Wisniewski, Guillaume; Bentivogli, Luisa
The Unheard Alternative: Contrastive Explanations for Speech-to-Text Models / Conti, Lina; Fucci, Dennis; Gaido, Marco; Negri, Matteo; Wisniewski, Guillaume; Bentivogli, Luisa. - ELETTRONICO. - (2025), pp. 398-414. ( BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP Suzhou 9th November 2025) [10.18653/v1/2025.blackboxnlp-1.23].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/467675
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact