This paper presents a new resource, called Content Types Dataset, to promote the analysis of texts as a composition of units with specific semantic and functional roles. By developing this dataset, we also introduce a new NLP task for the automatic classification of Content Types. The annotation scheme and the dataset are described together with two sets of classification experiments.
The content types dataset: a new resource to explore semantic and functional characteristics of texts / Sprugnoli, Rachele; Tommaso, Caselli; Sara, Tonelli; Giovanni, Moretti. - ELETTRONICO. - (2017), pp. 260-266. (Intervento presentato al convegno EACL 2017 - European Chapter of the Association for Computational Linguistics 2017 tenutosi a Valencia, Spain nel 3-7 April 2017).
The content types dataset: a new resource to explore semantic and functional characteristics of texts
Sprugnoli, Rachele;
2017-01-01
Abstract
This paper presents a new resource, called Content Types Dataset, to promote the analysis of texts as a composition of units with specific semantic and functional roles. By developing this dataset, we also introduce a new NLP task for the automatic classification of Content Types. The annotation scheme and the dataset are described together with two sets of classification experiments.File | Dimensione | Formato | |
---|---|---|---|
E17-2042.pdf
Solo gestori archivio
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Altra licenza (Other type of license)
Dimensione
115.74 kB
Formato
Adobe PDF
|
115.74 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione