Exploiting Contextualized Word Representations to Profile Haters on Twitter

Ceron, Tanise; Casula, Camilla

In this paper, we present our submission to the Profiling Haters on Twitter shared task at PAN@CLEF2021. The task aims at analyzing Twitter feeds of users in two languages, English and Spanish, in order to determine whether these users spread hate speech on social media. For English, we propose an approach which exploits contextualized word embeddings and a statistical feature extraction method, in order to find words which are used in different contexts by haters and non-haters, and we use these words as features to train a classifier. For Spanish, on the other hand, we take advantage of BERT sequence representations, using the average of the sequence representations of all tweets from a user as a feature to train a model for classifying users into haters and non-haters.

Exploiting Contextualized Word Representations to Profile Haters on Twitter / Ceron, T., Casula, C.. - 2936:160(2021), pp. 1871-1882. (22nd Working Notes of CLEF - Conference and Labs of the Evaluation Forum, CLEF-WN 2021 Online, Bucharest 21st Sept- 24th Sept 2021).

Exploiting Contextualized Word Representations to Profile Haters on Twitter

Tanise Ceron;Camilla Casula

2021-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2021
			
	Titolo del volume (Proceedings title)
	
				CLEF 2021 Working Notes. Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum
			
	Luogo di edizione (Place of publication)
	
				Aachen
			
	Casa editrice (Publisher)
	
				CEUR-WS
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85113512826
			
	Tutti gli autori
	
						Ceron, Tanise; Casula, Camilla
					
	Citazione
	
				Exploiting Contextualized Word Representations to Profile Haters on Twitter / Ceron, T., Casula, C.. - 2936:160(2021), pp. 1871-1882. (22nd Working Notes of CLEF - Conference and Labs of the Evaluation Forum, CLEF-WN 2021 Online, Bucharest 21st Sept- 24th Sept 2021).
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
ceron_2021.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Creative commons Dimensione 952.04 kB Formato Adobe PDF Visualizza/Apri	952.04 kB	Adobe PDF	Visualizza/Apri