Detecting depression or personality traits, tutoring and student behaviour systems, or identifying cases of cyber-bulling are a few of the wide range of the applications, in which the automatic detection of emotion is crucial. This task can contribute to the benefit of business, society, politics or education. The main objective of our research is focused on the improvement of the supervised emotion detection systems developed so far, through the definition and implementation of a technique to annotate large scale English emotional corpora automatically and with high standards of reliability. Our proposal is based on a bootstrapping process made up two main steps: the creation of the seed using NRC Emotion Lexicon and its extension employing the distributional semantic similarity through words embeddings. The results obtained are promising and allow us to confirm the soundness of the bootstrapping technique combined with the word embedding to label emotional corpora automatically.

Bootstrapping technique + embeddings = emotional corpus annotated automatically / Canales, Lea; Strapparava, Carlo; Boldrini, Ester; Martinez-Barco, Patricio. - 10341:(2017), pp. 110-121. (Intervento presentato al convegno Future and Emerging Trends in Language Technologies, Machine Learning and Big Data (FETLT-2016) tenutosi a Sevilla, Spain nel December) [10.1007/978-3-319-69365-1_9].

Bootstrapping technique + embeddings = emotional corpus annotated automatically

Carlo Strapparava;
2017-01-01

Abstract

Detecting depression or personality traits, tutoring and student behaviour systems, or identifying cases of cyber-bulling are a few of the wide range of the applications, in which the automatic detection of emotion is crucial. This task can contribute to the benefit of business, society, politics or education. The main objective of our research is focused on the improvement of the supervised emotion detection systems developed so far, through the definition and implementation of a technique to annotate large scale English emotional corpora automatically and with high standards of reliability. Our proposal is based on a bootstrapping process made up two main steps: the creation of the seed using NRC Emotion Lexicon and its extension employing the distributional semantic similarity through words embeddings. The results obtained are promising and allow us to confirm the soundness of the bootstrapping technique combined with the word embedding to label emotional corpora automatically.
2017
Proceedings of Future and Emerging Trends in Language Technologies, Machine Learning and Big Data (FETLT-2016)
Switzerland
Springer
978-3-319-69364-4
Canales, Lea; Strapparava, Carlo; Boldrini, Ester; Martinez-Barco, Patricio
Bootstrapping technique + embeddings = emotional corpus annotated automatically / Canales, Lea; Strapparava, Carlo; Boldrini, Ester; Martinez-Barco, Patricio. - 10341:(2017), pp. 110-121. (Intervento presentato al convegno Future and Emerging Trends in Language Technologies, Machine Learning and Big Data (FETLT-2016) tenutosi a Sevilla, Spain nel December) [10.1007/978-3-319-69365-1_9].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/344060
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact