Towards Reliable Hybrid Human-Machine Classifiers

Sayin Günel, Burcu

doi:10.15168/11572_349843

In this thesis, we focus on building reliable hybrid human-machine classifiers to be deployed in cost-sensitive classification tasks. The objective is to assess ML quality in hybrid classification contexts and design the appropriate metrics, thereby knowing whether we can trust the model predictions and identifying the subset of items on which the model is well-calibrated and trustworthy. We start by discussing the key concepts, research questions, challenges, and architecture to design and implement an effective hybrid classification service. We then present a deeper investigation of each service component along with our solutions and results. We mainly contribute to cost-sensitive hybrid classification, selective classification, model calibration, and active learning. We highlight the importance of model calibration in hybrid classification services and propose novel approaches to improve the calibration of human-machine classifiers. In addition, we argue that the current accuracy-based metrics are misaligned with the actual value of machine learning models and propose a novel metric ``value". We further test the performance of SOTA machine learning models in NLP tasks with a cost-sensitive hybrid classification context. We show that the performance of the SOTA models in cost-sensitive tasks significantly drops when we evaluate them according to value rather than accuracy. Finally, we investigate the quality of hybrid classifiers in the active learning scenarios. We review the existing active learning strategies, evaluate their effectiveness, and propose a novel value-aware active learning strategy to improve the performance of selective classifiers in the active learning of cost-sensitive tasks.

Towards Reliable Hybrid Human-Machine Classifiers / Sayin Günel, Burcu. - (2022 Sep 26), pp. 1-165. [10.15168/11572_349843]

Towards Reliable Hybrid Human-Machine Classifiers

Sayin Günel, Burcu

2022-09-26

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di esame finale/Defended on
	
				26-set-2022
			
	Ciclo
	
				XXXIV
			
	Anno Accademico
	
				2021-2022
			
	Dipartimento
	
				Ingegneria e scienza dell'Informaz (29/10/12-)
			
	Corso di dottorato
	
				Information and Communication Technology
			
	Supervisore/Relatore di tesi Unitn (Unitn internal supervisor)
	
				Casati, Fabio
			
	Supervisore aggiunto/Correlatore Unitn (Unitn Co-Supervisor)
	
				Passerini, Andrea
			
	Tesi in cotutela (Bi-nationally supervised Doctoral Thesis)
	
				no
			
	Codice DOI
	
				https://dx.doi.org/10.15168/11572_349843
			
	Lingua (Language)
	
				Inglese
			
	Settori scientifico-disciplinari (validi fino a 24/06/2024) - Reference SSD (valid until 24/06/2024)
	
				Settore INF/01 - Informatica
			
	Appare nelle tipologie:
	
				08.1 Tesi di dottorato (Doctoral Thesis)

File in questo prodotto:

File	Dimensione	Formato
Burcu_PhD_Thesis_September_2022.pdf Open Access dal 27/09/2023 Tipologia: Tesi di dottorato (Doctoral Thesis) Licenza: Creative commons Dimensione 10.18 MB Formato Adobe PDF Visualizza/Apri	10.18 MB	Adobe PDF	Visualizza/Apri