The adoption of deep learning models has brought significant performance improvements across several research fields, such as computer vision and natural language processing. However, their 'black-box' nature yields the downside of poor explainability: in particular, several real-world applications require - to varying extents - reliable confidence scores associated to a model's prediction. The relation between a model's accuracy and confidence is typically referred to as calibration. In this work, we propose a novel calibration method based on gradient accumulation in conjunction with existing loss regularization techniques. Our experiments on the Named Entity Recognition task show an improvement of the performance/calibration ratio compared to the current methods.
A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models / Jouet, Gregor; Duhart, Clement; Staiano, Jacopo; Rousseaux, Francis; De Runz, Cyril. - (2022), pp. 01-08. ( IJCNN 2022 Padova 18th-23th July 2022) [10.1109/IJCNN55064.2022.9892324].
A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models
Staiano, Jacopo;
2022-01-01
Abstract
The adoption of deep learning models has brought significant performance improvements across several research fields, such as computer vision and natural language processing. However, their 'black-box' nature yields the downside of poor explainability: in particular, several real-world applications require - to varying extents - reliable confidence scores associated to a model's prediction. The relation between a model's accuracy and confidence is typically referred to as calibration. In this work, we propose a novel calibration method based on gradient accumulation in conjunction with existing loss regularization techniques. Our experiments on the Named Entity Recognition task show an improvement of the performance/calibration ratio compared to the current methods.| File | Dimensione | Formato | |
|---|---|---|---|
|
A_Novel_Gradient_Accumulation_Method_for_Calibration_of_Named_Entity_Recognition_Models.pdf
Solo gestori archivio
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
934.99 kB
Formato
Adobe PDF
|
934.99 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione



