How do you feel? Measuring User-Perceived Value for Rejecting Machine Decisions in Hate Speech Detection

Lammerts, P.; Lippmann, P.; Hsu, Y. -C.; Casati, F.; Yang, J.

doi:10.1145/3600211.3604655

Hate speech moderation remains a challenging task for social media platforms. Human-AI collaborative systems offer the potential to combine the strengths of humans’ reliability and the scalability of machine learning to tackle this issue effectively. While methods for task handover in human-AI collaboration exist that consider the costs of incorrect predictions, insufficient attention has been paid to accurately estimating these costs. In this work, we propose a valuesensitive rejection mechanism that automatically rejects machine decisions for human moderation based on users’ value perceptions regarding machine decisions. We conduct a crowdsourced survey study with 160 participants to evaluate their perception of correct and incorrect machine decisions in the domain of hate speech detection, as well as occurrences where the system rejects making a prediction. Here, we introduce Magnitude Estimation, an unbounded scale, as the preferred method for measuring user (dis)agreement with machine decisions. Our results show that Magnitude Estimation can provide a reliable measurement of participants’ perception of machine decisions. By integrating user-perceived value into human-AI collaboration, we further show that it can guide us in 1) determining when to accept or reject machine decisions to obtain the optimal total value a model can deliver and 2) selecting better classification models as compared to the more widely used target of model accuracy.

How do you feel? Measuring User-Perceived Value for Rejecting Machine Decisions in Hate Speech Detection / Lammerts, P.; Lippmann, P.; Hsu, Y. -C.; Casati, F.; Yang, J.. - (2023), pp. 834-844. (Intervento presentato al convegno 2023 AAAI / ACM Conference on Artificial Intelligence, Ethics, and Society, AIES 2023 tenutosi a Montreal, QC Canada nel 8-10 August 2023) [10.1145/3600211.3604655].

How do you feel? Measuring User-Perceived Value for Rejecting Machine Decisions in Hate Speech Detection

Lammerts, P.;Lippmann, P.;Hsu, Y. -C.;Casati, F.;Yang, J.

2023-01-01

Abstract

Hate speech moderation remains a challenging task for social media platforms. Human-AI collaborative systems offer the potential to combine the strengths of humans’ reliability and the scalability of machine learning to tackle this issue effectively. While methods for task handover in human-AI collaboration exist that consider the costs of incorrect predictions, insufficient attention has been paid to accurately estimating these costs. In this work, we propose a valuesensitive rejection mechanism that automatically rejects machine decisions for human moderation based on users’ value perceptions regarding machine decisions. We conduct a crowdsourced survey study with 160 participants to evaluate their perception of correct and incorrect machine decisions in the domain of hate speech detection, as well as occurrences where the system rejects making a prediction. Here, we introduce Magnitude Estimation, an unbounded scale, as the preferred method for measuring user (dis)agreement with machine decisions. Our results show that Magnitude Estimation can provide a reliable measurement of participants’ perception of machine decisions. By integrating user-perceived value into human-AI collaboration, we further show that it can guide us in 1) determining when to accept or reject machine decisions to obtain the optimal total value a model can deliver and 2) selecting better classification models as compared to the more widely used target of model accuracy.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2023
			
	Titolo del volume (Proceedings title)
	
				AIES '23: Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society
			
	Luogo di edizione (Place of publication)
	
				1601 Broadway, 10th Floor, NEW YORK, NY, UNITED STATES
			
	Casa editrice (Publisher)
	
				Association for Computing Machinery, Inc
			
	ISBN
	
				9798400702310
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85173608009
			
	Codice WOS (WOS identifier)
	
				WOS:001117838100063
			
	Tutti gli autori
	
						Lammerts, P.; Lippmann, P.; Hsu, Y. -C.; Casati, F.; Yang, J.
					
	Citazione
	
				How do you feel? Measuring User-Perceived Value for Rejecting Machine Decisions in Hate Speech Detection / Lammerts, P.; Lippmann, P.; Hsu, Y. -C.; Casati, F.; Yang, J.. - (2023), pp. 834-844. (Intervento presentato al  convegno 2023 AAAI / ACM Conference on Artificial Intelligence, Ethics, and Society, AIES 2023 tenutosi a Montreal, QC Canada nel 8-10 August 2023) [10.1145/3600211.3604655].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
how do you feel.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Creative commons Dimensione 798.62 kB Formato Adobe PDF Visualizza/Apri	798.62 kB	Adobe PDF	Visualizza/Apri