The adoption of deep learning models has brought significant performance improvements across several research fields, such as computer vision and natural language processing. However, their 'black-box' nature yields the downside of poor explainability: in particular, several real-world applications require - to varying extents - reliable confidence scores associated to a model's prediction. The relation between a model's accuracy and confidence is typically referred to as calibration. In this work, we propose a novel calibration method based on gradient accumulation in conjunction with existing loss regularization techniques. Our experiments on the Named Entity Recognition task show an improvement of the performance/calibration ratio compared to the current methods.

A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models / Jouet, G.; Duhart, C.; Staiano, J.; Rousseaux, F.; De Runz, C.. - 2022-:(2022), pp. 01-08. (Intervento presentato al convegno 2022 International Joint Conference on Neural Networks, IJCNN 2022 tenutosi a Padova nel 2022) [10.1109/IJCNN55064.2022.9892324].

A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models

Staiano J.;
2022-01-01

Abstract

The adoption of deep learning models has brought significant performance improvements across several research fields, such as computer vision and natural language processing. However, their 'black-box' nature yields the downside of poor explainability: in particular, several real-world applications require - to varying extents - reliable confidence scores associated to a model's prediction. The relation between a model's accuracy and confidence is typically referred to as calibration. In this work, we propose a novel calibration method based on gradient accumulation in conjunction with existing loss regularization techniques. Our experiments on the Named Entity Recognition task show an improvement of the performance/calibration ratio compared to the current methods.
2022
Proceedings of the International Joint Conference on Neural Networks
New York
Institute of Electrical and Electronics Engineers Inc.
978-1-7281-8671-9
Jouet, G.; Duhart, C.; Staiano, J.; Rousseaux, F.; De Runz, C.
A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models / Jouet, G.; Duhart, C.; Staiano, J.; Rousseaux, F.; De Runz, C.. - 2022-:(2022), pp. 01-08. (Intervento presentato al convegno 2022 International Joint Conference on Neural Networks, IJCNN 2022 tenutosi a Padova nel 2022) [10.1109/IJCNN55064.2022.9892324].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/362928
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact