Simultaneous speech translation (SimulST) is the task in which output generation has to be performed on partial, incremental speech input. In recent years, SimulST has become popular due to the spread of multilingual application scenarios, like international live conferences and streaming lectures, in which on-the-fly speech translation can facilitate users' access to audio-visual content. In this paper, we analyze the characteristics of the SimulST systems developed so far, discussing their strengths and weaknesses. We then concentrate on the evaluation framework required to properly assess systems' effectiveness. To this end, we raise the need for a broader performance analysis, also including the user experience standpoint. We argue that SimulST systems, indeed, should be evaluated not only in terms of quality/latency measures, but also via task-oriented metrics accounting, for instance, for the visualization strategy adopted. In light of this, we highlight which are the goals achieved by the community and what is still missing.

Visualization: The missing factor in simultaneous speech translation / Papi, Sara; Negri, Matteo; Turchi, Marco. - 3033:(2021). (Intervento presentato al convegno 8th Italian Conference on Computational Linguistics, CLiC-it 2021 tenutosi a Universita degli Studi di Milano-Bicocca, ita nel June 29 - July 1 2022).

Visualization: The missing factor in simultaneous speech translation

Sara Papi
Primo
;
Marco Turchi
Ultimo
2021-01-01

Abstract

Simultaneous speech translation (SimulST) is the task in which output generation has to be performed on partial, incremental speech input. In recent years, SimulST has become popular due to the spread of multilingual application scenarios, like international live conferences and streaming lectures, in which on-the-fly speech translation can facilitate users' access to audio-visual content. In this paper, we analyze the characteristics of the SimulST systems developed so far, discussing their strengths and weaknesses. We then concentrate on the evaluation framework required to properly assess systems' effectiveness. To this end, we raise the need for a broader performance analysis, also including the user experience standpoint. We argue that SimulST systems, indeed, should be evaluated not only in terms of quality/latency measures, but also via task-oriented metrics accounting, for instance, for the visualization strategy adopted. In light of this, we highlight which are the goals achieved by the community and what is still missing.
2021
Italian Conference on Computational Linguistics 2021 CEUR Workshop Proceedings
Aachen, Germany
CEUR-WS
Papi, Sara; Negri, Matteo; Turchi, Marco
Visualization: The missing factor in simultaneous speech translation / Papi, Sara; Negri, Matteo; Turchi, Marco. - 3033:(2021). (Intervento presentato al convegno 8th Italian Conference on Computational Linguistics, CLiC-it 2021 tenutosi a Universita degli Studi di Milano-Bicocca, ita nel June 29 - July 1 2022).
File in questo prodotto:
File Dimensione Formato  
paper22.pdf

accesso aperto

Tipologia: Post-print referato (Refereed author’s manuscript)
Licenza: Creative commons
Dimensione 278.79 kB
Formato Adobe PDF
278.79 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/369988
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact