Visualization: The missing factor in simultaneous speech translation

IRIS

Simultaneous speech translation (SimulST) is the task in which output generation has to be performed on partial, incremental speech input. In recent years, SimulST has become popular due to the spread of multilingual application scenarios, like international live conferences and streaming lectures, in which on-the-fly speech translation can facilitate users' access to audio-visual content. In this paper, we analyze the characteristics of the SimulST systems developed so far, discussing their strengths and weaknesses. We then concentrate on the evaluation framework required to properly assess systems' effectiveness. To this end, we raise the need for a broader performance analysis, also including the user experience standpoint. We argue that SimulST systems, indeed, should be evaluated not only in terms of quality/latency measures, but also via task-oriented metrics accounting, for instance, for the visualization strategy adopted. In light of this, we highlight which are the goals achieved by the community and what is still missing.

Visualization: The missing factor in simultaneous speech translation / Papi, Sara; Negri, Matteo; Turchi, Marco. - 3033:(2021). ( 8th Italian Conference on Computational Linguistics, CLiC-it 2021 Universita degli Studi di Milano-Bicocca, ita June 29 - July 1 2022).

Visualization: The missing factor in simultaneous speech translation

Sara Papi^Primo;Matteo Negri^Secondo;Marco Turchi^Ultimo

2021-01-01

Abstract

Simultaneous speech translation (SimulST) is the task in which output generation has to be performed on partial, incremental speech input. In recent years, SimulST has become popular due to the spread of multilingual application scenarios, like international live conferences and streaming lectures, in which on-the-fly speech translation can facilitate users' access to audio-visual content. In this paper, we analyze the characteristics of the SimulST systems developed so far, discussing their strengths and weaknesses. We then concentrate on the evaluation framework required to properly assess systems' effectiveness. To this end, we raise the need for a broader performance analysis, also including the user experience standpoint. We argue that SimulST systems, indeed, should be evaluated not only in terms of quality/latency measures, but also via task-oriented metrics accounting, for instance, for the visualization strategy adopted. In light of this, we highlight which are the goals achieved by the community and what is still missing.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2021
			
	Titolo del volume (Proceedings title)
	
				Italian Conference on Computational Linguistics 2021 CEUR Workshop Proceedings
			
	Luogo di edizione (Place of publication)
	
				Aachen, Germany
			
	Casa editrice (Publisher)
	
				CEUR-WS
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85121212741
			
	Tutti gli autori
	
						Papi, Sara; Negri, Matteo; Turchi, Marco
					
	Citazione
	
				Visualization: The missing factor in simultaneous speech translation / Papi, Sara; Negri, Matteo; Turchi, Marco. - 3033:(2021). ( 8th Italian Conference on Computational Linguistics, CLiC-it 2021 Universita degli Studi di Milano-Bicocca, ita June 29 - July 1 2022).
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
paper22.pdf accesso aperto Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Creative commons Dimensione 278.79 kB Formato Adobe PDF Visualizza/Apri	278.79 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/369988

Citazioni

ND

1

ND

ND

social impact