Automatic detection of irony is one of the hot topics for sentiment analysis, as it changes the polarity of text. Most of the work has been focused on the detection of figurative language in Twitter data due to relative ease of obtaining annotated data, thanks to the use of hashtags to signal irony. However, irony is present generally in natural language conversations and in particular in online public fora. In this paper, we present a comparative evaluation of irony detection from Italian news fora and Twitter posts. Since irony is not a very frequent phenomenon, its automatic detection suffers from data imbalance and feature sparseness problems. We experiment with different representations of text – bag-of-words, writing style, and word embeddings to address the feature sparseness; and balancing techniques to address the data imbalance.
Irony detection: From the twittersphere to the News Space / Cervone, Alessandra; Stepanov, Evgeny A.; Celli, Fabio; Riccardi, Giuseppe. - 2006:(2017). (Intervento presentato al convegno 4th Italian Conference on Computational Linguistics, CLiC-it 2017 tenutosi a Roma nel 11-13 december 2017).
Irony detection: From the twittersphere to the News Space
Cervone, Alessandra;Stepanov, Evgeny A.;Celli, Fabio;Riccardi, Giuseppe
2017-01-01
Abstract
Automatic detection of irony is one of the hot topics for sentiment analysis, as it changes the polarity of text. Most of the work has been focused on the detection of figurative language in Twitter data due to relative ease of obtaining annotated data, thanks to the use of hashtags to signal irony. However, irony is present generally in natural language conversations and in particular in online public fora. In this paper, we present a comparative evaluation of irony detection from Italian news fora and Twitter posts. Since irony is not a very frequent phenomenon, its automatic detection suffers from data imbalance and feature sparseness problems. We experiment with different representations of text – bag-of-words, writing style, and word embeddings to address the feature sparseness; and balancing techniques to address the data imbalance.File | Dimensione | Formato | |
---|---|---|---|
paper046.pdf
accesso aperto
Tipologia:
Post-print referato (Refereed author’s manuscript)
Licenza:
Altra licenza (Other type of license)
Dimensione
126.96 kB
Formato
Adobe PDF
|
126.96 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione