In this paper, we present our submission to the Profiling Haters on Twitter shared task at PAN@CLEF2021. The task aims at analyzing Twitter feeds of users in two languages, English and Spanish, in order to determine whether these users spread hate speech on social media. For English, we propose an approach which exploits contextualized word embeddings and a statistical feature extraction method, in order to find words which are used in different contexts by haters and non-haters, and we use these words as features to train a classifier. For Spanish, on the other hand, we take advantage of BERT sequence representations, using the average of the sequence representations of all tweets from a user as a feature to train a model for classifying users into haters and non-haters.

Exploiting contextualized word representations to profile haters on Twitter / Ceron, T.; Casula, C.. - 2936:(2021), pp. 1871-1882. (Intervento presentato al convegno 2021 Working Notes of CLEF - Conference and Labs of the Evaluation Forum, CLEF-WN 2021 tenutosi a Online, Bucharest nel 21st Sept- 24th Sept 2021).

Exploiting contextualized word representations to profile haters on Twitter

Casula C.
2021-01-01

Abstract

In this paper, we present our submission to the Profiling Haters on Twitter shared task at PAN@CLEF2021. The task aims at analyzing Twitter feeds of users in two languages, English and Spanish, in order to determine whether these users spread hate speech on social media. For English, we propose an approach which exploits contextualized word embeddings and a statistical feature extraction method, in order to find words which are used in different contexts by haters and non-haters, and we use these words as features to train a classifier. For Spanish, on the other hand, we take advantage of BERT sequence representations, using the average of the sequence representations of all tweets from a user as a feature to train a model for classifying users into haters and non-haters.
2021
CEUR Workshop Proceedings
Aachen
CEUR-WS
Ceron, T.; Casula, C.
Exploiting contextualized word representations to profile haters on Twitter / Ceron, T.; Casula, C.. - 2936:(2021), pp. 1871-1882. (Intervento presentato al convegno 2021 Working Notes of CLEF - Conference and Labs of the Evaluation Forum, CLEF-WN 2021 tenutosi a Online, Bucharest nel 21st Sept- 24th Sept 2021).
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/330503
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact