In this paper we present a keyphrase extraction system called Keyphrase Digger (KD). The tool uses both statistical measures and linguistic information to detect a weighted list of n-grams representing the most important concepts of a text. KD is the reimplementation of an existing tool, which has been extended with new features, a high level of customizability, a shorter processing time and an extensive evaluation on different text genres in English and Italian (ie scientific articles and historical texts).

Digging in the Dirt: Extracting Keyphrases from Texts with KD

Sprugnoli, Rachele;
2015-01-01

Abstract

In this paper we present a keyphrase extraction system called Keyphrase Digger (KD). The tool uses both statistical measures and linguistic information to detect a weighted list of n-grams representing the most important concepts of a text. KD is the reimplementation of an existing tool, which has been extended with new features, a high level of customizability, a shorter processing time and an extensive evaluation on different text genres in English and Italian (ie scientific articles and historical texts).
2015
Proceedings of the Second Italian Conference on Computational Linguistics CLiC-it 2015
Torino
Accademia University Press srl
Moretti, Giovanni; Sprugnoli, Rachele; Tonelli, Sara
File in questo prodotto:
File Dimensione Formato  
Accademia_University_Press_978-88-99200-62-6.200-205.pdf

Solo gestori archivio

Descrizione: Paper
Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Creative commons
Dimensione 341.11 kB
Formato Adobe PDF
341.11 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/164463
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact