In this thesis we investigate the automatic analysis of agreement and disagreement in political documents. Our focus is on the comparison of statements about specific topics extracted from documents with no direct interaction (e.g. electoral speeches or political manifestos), in which politicians may express, sometime in an implicit way, their position. This is a challenging task, made difficult also due to the lack of annotated resources. Our contribution can be divided into two main areas. The first one is the creation of manually and automatically annotated corpora for the task (pairs of statements annotated for agreement or disagreement from different sources). The second one is a Natural Language Processing (NLP) pipeline for the automatic (supervised) classification of agreement and disagreement. This pipeline involves a novel approach to extract well-defined and accurate topics based on key-concept clusters, and two classifiers to identify the pairs of statements in agreement and disagreement (or holding no relation) according to a wide set of features, such as sentiment, entailment, and semantic representation of the topics. We think that our findings can effectively support political science researchers dealing with an increasing amount of digital data, providing insight into similarities and differences in ideologies.
Automatic Analysis of Agreement and Disagreement in the Political Domain / Menini, Stefano. - (2018), pp. 1-121.
Automatic Analysis of Agreement and Disagreement in the Political Domain
Menini, Stefano
2018-01-01
Abstract
In this thesis we investigate the automatic analysis of agreement and disagreement in political documents. Our focus is on the comparison of statements about specific topics extracted from documents with no direct interaction (e.g. electoral speeches or political manifestos), in which politicians may express, sometime in an implicit way, their position. This is a challenging task, made difficult also due to the lack of annotated resources. Our contribution can be divided into two main areas. The first one is the creation of manually and automatically annotated corpora for the task (pairs of statements annotated for agreement or disagreement from different sources). The second one is a Natural Language Processing (NLP) pipeline for the automatic (supervised) classification of agreement and disagreement. This pipeline involves a novel approach to extract well-defined and accurate topics based on key-concept clusters, and two classifiers to identify the pairs of statements in agreement and disagreement (or holding no relation) according to a wide set of features, such as sentiment, entailment, and semantic representation of the topics. We think that our findings can effectively support political science researchers dealing with an increasing amount of digital data, providing insight into similarities and differences in ideologies.File | Dimensione | Formato | |
---|---|---|---|
Disclaimer.pdf
Solo gestori archivio
Tipologia:
Tesi di dottorato (Doctoral Thesis)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
88.46 kB
Formato
Adobe PDF
|
88.46 kB | Adobe PDF | Visualizza/Apri |
thesis_menini.pdf
Solo gestori archivio
Tipologia:
Tesi di dottorato (Doctoral Thesis)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
2.27 MB
Formato
Adobe PDF
|
2.27 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione