In this thesis we investigate the automatic analysis of agreement and disagreement in political documents. Our focus is on the comparison of statements about specific topics extracted from documents with no direct interaction (e.g. electoral speeches or political manifestos), in which politicians may express, sometime in an implicit way, their position. This is a challenging task, made difficult also due to the lack of annotated resources. Our contribution can be divided into two main areas. The first one is the creation of manually and automatically annotated corpora for the task (pairs of statements annotated for agreement or disagreement from different sources). The second one is a Natural Language Processing (NLP) pipeline for the automatic (supervised) classification of agreement and disagreement. This pipeline involves a novel approach to extract well-defined and accurate topics based on key-concept clusters, and two classifiers to identify the pairs of statements in agreement and disagreement (or holding no relation) according to a wide set of features, such as sentiment, entailment, and semantic representation of the topics. We think that our findings can effectively support political science researchers dealing with an increasing amount of digital data, providing insight into similarities and differences in ideologies.

Automatic Analysis of Agreement and Disagreement in the Political Domain

Menini, Stefano
2018

Abstract

In this thesis we investigate the automatic analysis of agreement and disagreement in political documents. Our focus is on the comparison of statements about specific topics extracted from documents with no direct interaction (e.g. electoral speeches or political manifestos), in which politicians may express, sometime in an implicit way, their position. This is a challenging task, made difficult also due to the lack of annotated resources. Our contribution can be divided into two main areas. The first one is the creation of manually and automatically annotated corpora for the task (pairs of statements annotated for agreement or disagreement from different sources). The second one is a Natural Language Processing (NLP) pipeline for the automatic (supervised) classification of agreement and disagreement. This pipeline involves a novel approach to extract well-defined and accurate topics based on key-concept clusters, and two classifiers to identify the pairs of statements in agreement and disagreement (or holding no relation) according to a wide set of features, such as sentiment, entailment, and semantic representation of the topics. We think that our findings can effectively support political science researchers dealing with an increasing amount of digital data, providing insight into similarities and differences in ideologies.
2018
Inglese
Tonelli, Sara
Università degli studi di Trento
TRENTO
121
File in questo prodotto:
File Dimensione Formato  
Disclaimer.pdf

accesso solo da BNCF e BNCR

Dimensione 88.63 kB
Formato Adobe PDF
88.63 kB Adobe PDF
thesis_menini.pdf

accesso solo da BNCF e BNCR

Dimensione 2.27 MB
Formato Adobe PDF
2.27 MB Adobe PDF

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/178423
Il codice NBN di questa tesi è URN:NBN:IT:UNITN-178423