Tilt loss: a perceptual loss function to improve music packet loss concealment

IRIS

Recent advancements in deep learning have paved the way for novel approaches to the problem of Packet Loss Concealment (PLC) in networked music performance systems. However, deep neural networks may have large inference times and, therefore, violate the strict temporal requirements of PLC methods for such systems. A promising avenue in this space lies in the exploration of the loss function used to train the network. Indeed, loss functions have a direct impact on the latent representation learned by the model during the training process without any additional cost at inference time. In this paper, we present the Tilt Loss, a perceptual loss function, i.e., a loss function that allows the model trained with it to have performances that correlate with the human evaluation. The proposed method was able to outperform the current state-of-the-art in PLC methods according to human evaluation, albeit the model exhibited unsatisfactory performance with unpitched instruments. Furthermore, our study pinpoints the need for novel objective metrics specifically tailored for the PLC case.

Tilt loss: a perceptual loss function to improve music packet loss concealment / Daniotti, F., Vignati, L., Turchet, L.. - In: EURASIP JOURNAL ON AUDIO, SPEECH, AND MUSIC PROCESSING. - ISSN 1687-4714. - 2026:1(2026). [10.1186/S13636-025-00442-1]

Tilt loss: a perceptual loss function to improve music packet loss concealment

Daniotti, Filippo;Vignati, Luca;Turchet, Luca

2026-01-01

Abstract

Recent advancements in deep learning have paved the way for novel approaches to the problem of Packet Loss Concealment (PLC) in networked music performance systems. However, deep neural networks may have large inference times and, therefore, violate the strict temporal requirements of PLC methods for such systems. A promising avenue in this space lies in the exploration of the loss function used to train the network. Indeed, loss functions have a direct impact on the latent representation learned by the model during the training process without any additional cost at inference time. In this paper, we present the Tilt Loss, a perceptual loss function, i.e., a loss function that allows the model trained with it to have performances that correlate with the human evaluation. The proposed method was able to outperform the current state-of-the-art in PLC methods according to human evaluation, albeit the model exhibited unsatisfactory performance with unpitched instruments. Furthermore, our study pinpoints the need for novel objective metrics specifically tailored for the PLC case.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2026
			
	Titolo del periodico (Journal title)
	
				EURASIP JOURNAL ON AUDIO, SPEECH, AND MUSIC PROCESSING
			
	Numero e parte del fascicolo (Issue number and part)
	
				1
			
	DOI
	
				https://dx.doi.org/10.1186/S13636-025-00442-1
			
	Codice WOS (WOS identifier)
	
				WOS:001718764500001
			
	Tutti gli autori
	
						Daniotti, Filippo; Vignati, Luca; Turchet, Luca
					
	Citazione
	
				Tilt loss: a perceptual loss function to improve music packet loss concealment / Daniotti, F., Vignati, L., Turchet, L.. - In: EURASIP JOURNAL ON AUDIO, SPEECH, AND MUSIC PROCESSING. - ISSN 1687-4714. - 2026:1(2026). [10.1186/S13636-025-00442-1]

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/488032

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

ND

0

ND

social impact