EmoEvent: A multilingual emotion corpus based on different events

IRIS

In recent years emotion detection in text has become more popular due to its potential applications in fields such as psychology, marketing, political science, and artificial intelligence, among others. While opinion mining is a well-established task with many standard datasets and well-defined methodologies, emotion mining has received less attention due to its complexity. In particular, the annotated gold standard resources available are not enough. In order to address this shortage, we present a multilingual emotion dataset based on different events that took place in April 2019. We collected tweets from the Twitter platform. Then one of seven emotions, six Ekman's basic emotions plus the “neutral or other emotions”, was labeled on each tweet by 3 Amazon MTurkers. A total of 8,409 in Spanish and 7,303 in English were labeled. In addition, each tweet was also labeled as offensive or non-offensive. We report some linguistic statistics about the dataset in order to observe the difference between English and Spanish speakers when they express emotions related to the same events. Moreover, in order to validate the effectiveness of the dataset, we also propose a machine learning approach for automatically detecting emotions in tweets for both languages, English and Spanish.

EmoEvent: A multilingual emotion corpus based on different events / Plaza-Del-Arco, F.M., Strapparava, C., Alfonso Urena-Lopez, L., Teresa Martin-Valdivia, M.. - (2020), pp. 1492-1498. (12th International Conference on Language Resources and Evaluation, LREC 2020 Palais du Pharo, fra 2020).

EmoEvent: A multilingual emotion corpus based on different events

Plaza-Del-Arco F. M.;Strapparava C.;Alfonso Urena-Lopez L.;Teresa Martin-Valdivia M.

2020-01-01

Abstract

In recent years emotion detection in text has become more popular due to its potential applications in fields such as psychology, marketing, political science, and artificial intelligence, among others. While opinion mining is a well-established task with many standard datasets and well-defined methodologies, emotion mining has received less attention due to its complexity. In particular, the annotated gold standard resources available are not enough. In order to address this shortage, we present a multilingual emotion dataset based on different events that took place in April 2019. We collected tweets from the Twitter platform. Then one of seven emotions, six Ekman's basic emotions plus the “neutral or other emotions”, was labeled on each tweet by 3 Amazon MTurkers. A total of 8,409 in Spanish and 7,303 in English were labeled. In addition, each tweet was also labeled as offensive or non-offensive. We report some linguistic statistics about the dataset in order to observe the difference between English and Spanish speakers when they express emotions related to the same events. Moreover, in order to validate the effectiveness of the dataset, we also propose a machine learning approach for automatically detecting emotions in tweets for both languages, English and Spanish.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2020
			
	Titolo del volume (Proceedings title)
	
				LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings
			
	Luogo di edizione (Place of publication)
	
				France
			
	Casa editrice (Publisher)
	
				European Language Resources Association (ELRA)
			
	ISBN
	
				979-10-95546-34-4
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85092222225
			
	Codice WOS (WOS identifier)
	
				WOS:000724697201070
			
	Tutti gli autori
	
						Plaza-Del-Arco, F. M.; Strapparava, C.; Alfonso Urena-Lopez, L.; Teresa Martin-Valdivia, M.
					
	Citazione
	
				EmoEvent: A multilingual emotion corpus based on different events / Plaza-Del-Arco, F.M., Strapparava, C., Alfonso Urena-Lopez, L., Teresa Martin-Valdivia, M.. - (2020), pp. 1492-1498. (12th International Conference on Language Resources and Evaluation, LREC 2020 Palais du Pharo, fra 2020).
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/341953

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

85

44

ND

social impact