A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models

IRIS

The adoption of deep learning models has brought significant performance improvements across several research fields, such as computer vision and natural language processing. However, their 'black-box' nature yields the downside of poor explainability: in particular, several real-world applications require - to varying extents - reliable confidence scores associated to a model's prediction. The relation between a model's accuracy and confidence is typically referred to as calibration. In this work, we propose a novel calibration method based on gradient accumulation in conjunction with existing loss regularization techniques. Our experiments on the Named Entity Recognition task show an improvement of the performance/calibration ratio compared to the current methods.

A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models / Jouet, G.; Duhart, C.; Staiano, J.; Rousseaux, F.; De Runz, C.. - 2022-:(2022), pp. 01-08. (Intervento presentato al convegno 2022 International Joint Conference on Neural Networks, IJCNN 2022 tenutosi a Padova nel 2022) [10.1109/IJCNN55064.2022.9892324].

A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models

Jouet G.;Duhart C.;Staiano J.;Rousseaux F.;De Runz C.

2022-01-01

Abstract

The adoption of deep learning models has brought significant performance improvements across several research fields, such as computer vision and natural language processing. However, their 'black-box' nature yields the downside of poor explainability: in particular, several real-world applications require - to varying extents - reliable confidence scores associated to a model's prediction. The relation between a model's accuracy and confidence is typically referred to as calibration. In this work, we propose a novel calibration method based on gradient accumulation in conjunction with existing loss regularization techniques. Our experiments on the Named Entity Recognition task show an improvement of the performance/calibration ratio compared to the current methods.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2022
			
	Titolo del volume (Proceedings title)
	
				Proceedings of the International Joint Conference on Neural Networks
			
	Luogo di edizione (Place of publication)
	
				New York
			
	Casa editrice (Publisher)
	
				Institute of Electrical and Electronics Engineers Inc.
			
	ISBN
	
				978-1-7281-8671-9
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85140715410
			
	Codice WOS (WOS identifier)
	
				WOS:000867070903084
			
	Tutti gli autori
	
						Jouet, G.; Duhart, C.; Staiano, J.; Rousseaux, F.; De Runz, C.
					
	Citazione
	
				A Novel Gradient Accumulation Method for Calibration of Named Entity Recognition Models / Jouet, G.; Duhart, C.; Staiano, J.; Rousseaux, F.; De Runz, C.. - 2022-:(2022), pp. 01-08. (Intervento presentato al  convegno 2022 International Joint Conference on Neural Networks, IJCNN 2022 tenutosi a Padova nel 2022) [10.1109/IJCNN55064.2022.9892324].

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/362928

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

0

0

ND

social impact