The adoption of deep learning models has brought significant performance improvements across several research fields, such as computer vision and natural language processing. However, their 'black-box' nature yields the downside of poor explainability: in particular, several real-world applications require - to varying extents - reliable confidence scores associated to a model's prediction. The relation between a model's accuracy and confidence is typically referred to as calibration. In this work, we propose a novel calibration method based on gradient accumulation in conjunction with existing loss regularization techniques. Our experiments on the Named Entity Recognition task show an improvement of the performance/calibration ratio compared to the current methods.

A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models / Jouet, Gregor; Duhart, Clement; Staiano, Jacopo; Rousseaux, Francis; De Runz, Cyril. - (2022), pp. 01-08. ( IJCNN 2022 Padova 18th-23th July 2022) [10.1109/IJCNN55064.2022.9892324].

A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models

Staiano, Jacopo;
2022-01-01

Abstract

The adoption of deep learning models has brought significant performance improvements across several research fields, such as computer vision and natural language processing. However, their 'black-box' nature yields the downside of poor explainability: in particular, several real-world applications require - to varying extents - reliable confidence scores associated to a model's prediction. The relation between a model's accuracy and confidence is typically referred to as calibration. In this work, we propose a novel calibration method based on gradient accumulation in conjunction with existing loss regularization techniques. Our experiments on the Named Entity Recognition task show an improvement of the performance/calibration ratio compared to the current methods.
2022
2022 International Joint Conference on Neural Networks Proceedings
Piscataway, NJ
Institute of Electrical and Electronics Engineers Inc.
978-1-7281-8671-9
Jouet, Gregor; Duhart, Clement; Staiano, Jacopo; Rousseaux, Francis; De Runz, Cyril
A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models / Jouet, Gregor; Duhart, Clement; Staiano, Jacopo; Rousseaux, Francis; De Runz, Cyril. - (2022), pp. 01-08. ( IJCNN 2022 Padova 18th-23th July 2022) [10.1109/IJCNN55064.2022.9892324].
File in questo prodotto:
File Dimensione Formato  
A_Novel_Gradient_Accumulation_Method_for_Calibration_of_Named_Entity_Recognition_Models.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 934.99 kB
Formato Adobe PDF
934.99 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/362928
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex 0
social impact