Recent advancements in deep learning have paved the way for novel approaches to the problem of Packet Loss Concealment (PLC) in networked music performance systems. However, deep neural networks may have large inference times and, therefore, violate the strict temporal requirements of PLC methods for such systems. A promising avenue in this space lies in the exploration of the loss function used to train the network. Indeed, loss functions have a direct impact on the latent representation learned by the model during the training process without any additional cost at inference time. In this paper, we present the Tilt Loss, a perceptual loss function, i.e., a loss function that allows the model trained with it to have performances that correlate with the human evaluation. The proposed method was able to outperform the current state-of-the-art in PLC methods according to human evaluation, albeit the model exhibited unsatisfactory performance with unpitched instruments. Furthermore, our study pinpoints the need for novel objective metrics specifically tailored for the PLC case.

Tilt loss: a perceptual loss function to improve music packet loss concealment / Daniotti, F., Vignati, L., Turchet, L.. - In: EURASIP JOURNAL ON AUDIO, SPEECH, AND MUSIC PROCESSING. - ISSN 1687-4714. - 2026:1(2026). [10.1186/S13636-025-00442-1]

Tilt loss: a perceptual loss function to improve music packet loss concealment

Daniotti, Filippo;Vignati, Luca;Turchet, Luca
2026-01-01

Abstract

Recent advancements in deep learning have paved the way for novel approaches to the problem of Packet Loss Concealment (PLC) in networked music performance systems. However, deep neural networks may have large inference times and, therefore, violate the strict temporal requirements of PLC methods for such systems. A promising avenue in this space lies in the exploration of the loss function used to train the network. Indeed, loss functions have a direct impact on the latent representation learned by the model during the training process without any additional cost at inference time. In this paper, we present the Tilt Loss, a perceptual loss function, i.e., a loss function that allows the model trained with it to have performances that correlate with the human evaluation. The proposed method was able to outperform the current state-of-the-art in PLC methods according to human evaluation, albeit the model exhibited unsatisfactory performance with unpitched instruments. Furthermore, our study pinpoints the need for novel objective metrics specifically tailored for the PLC case.
2026
1
Daniotti, Filippo; Vignati, Luca; Turchet, Luca
Tilt loss: a perceptual loss function to improve music packet loss concealment / Daniotti, F., Vignati, L., Turchet, L.. - In: EURASIP JOURNAL ON AUDIO, SPEECH, AND MUSIC PROCESSING. - ISSN 1687-4714. - 2026:1(2026). [10.1186/S13636-025-00442-1]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/488032
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex ND
social impact