Counter Narrative Generation for Fighting Online Hate Speech

IRIS

Studies on online hate speech have mostly focused on the automated detection of harmful messages. Still, tackling hate speech in the standard way of content moderation may be charged with censorship and overblocking. One alternate strategy, that has received little attention so far, is to directly intervene in the discussion with counter narratives (i.e., textual responses meant to withstand hatred and prevent its spreading). While recent advances in NLP can help in automatizing counter narrative generation by using pre-trained language models, challenges such as lack of high-quality data, generic generation, hallucination and multilinguality must be addressed. This dissertation focuses on automatic and effective hate mitigation targeting Islamophobia by data collection and counter narrative generation. Firstly, we tackle the problem of data scarcity. We present CONAN, the first large-scale and expert-based hate countering dataset for English, Italian and French. Then, we present an author-reviewer approach that can create automatically high-quality data while reducing human effort. Secondly, we develop models to generate counter narratives focusing on informative and multilingual responses. We introduce a knowledge-driven pipeline that can produce suitable and informative English counter narratives while avoiding hallucination phenomena. We address multilinguality by presenting approaches to counter narrative generation for Italian, and characterizing the effect of data size and of data quality on model performance. Thirdly, we present an extensive evaluation of automatic counter narrative generation embedded in a platform that NGO operators can use to monitor social media data and counter hate speech. Results show an increased efficiency and effectiveness of operators' activities. We conclude by discussing our contributions and future research directions on building models for hate countering.

Counter Narrative Generation for Fighting Online Hate Speech / Chung, Yi-ling. - (2022 Apr 29), pp. 1-153. [10.15168/11572_338563]

Counter Narrative Generation for Fighting Online Hate Speech

Chung, Yi-ling

2022-04-29

Abstract

Studies on online hate speech have mostly focused on the automated detection of harmful messages. Still, tackling hate speech in the standard way of content moderation may be charged with censorship and overblocking. One alternate strategy, that has received little attention so far, is to directly intervene in the discussion with counter narratives (i.e., textual responses meant to withstand hatred and prevent its spreading). While recent advances in NLP can help in automatizing counter narrative generation by using pre-trained language models, challenges such as lack of high-quality data, generic generation, hallucination and multilinguality must be addressed. This dissertation focuses on automatic and effective hate mitigation targeting Islamophobia by data collection and counter narrative generation. Firstly, we tackle the problem of data scarcity. We present CONAN, the first large-scale and expert-based hate countering dataset for English, Italian and French. Then, we present an author-reviewer approach that can create automatically high-quality data while reducing human effort. Secondly, we develop models to generate counter narratives focusing on informative and multilingual responses. We introduce a knowledge-driven pipeline that can produce suitable and informative English counter narratives while avoiding hallucination phenomena. We address multilinguality by presenting approaches to counter narrative generation for Italian, and characterizing the effect of data size and of data quality on model performance. Thirdly, we present an extensive evaluation of automatic counter narrative generation embedded in a platform that NGO operators can use to monitor social media data and counter hate speech. Results show an increased efficiency and effectiveness of operators' activities. We conclude by discussing our contributions and future research directions on building models for hate countering.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di esame finale/Defended on
	
			29-apr-2022
		
	Ciclo
	
			XXXIV
		
	Anno Accademico
	
			2019-2020
		
	Dipartimento
	
			Ingegneria e scienza dell'Informaz (29/10/12-)
		
	Corso di dottorato
	
			Information and Communication Technology
		
	Supervisore/Relatore di tesi Unitn (Unitn internal supervisor)
	
			Guerini, Marco
		
	Tesi in cotutela (Bi-nationally supervised Doctoral Thesis)
	
			no
		
	Codice DOI
	
			https://dx.doi.org/10.15168/11572_338563
		
	Lingua (Language)
	
			Inglese
		
	Appare nelle tipologie:
	
			08.1 Tesi di dottorato (Doctoral Thesis)

File in questo prodotto:

File	Dimensione	Formato
PhD_Thesis_YiLingChung.pdf accesso aperto Descrizione: Main article Tipologia: Tesi di dottorato (Doctoral Thesis) Licenza: Creative commons Dimensione 2.36 MB Formato Adobe PDF Visualizza/Apri	2.36 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/338563

Citazioni

ND

ND

ND

social impact