In order to generate semantic annotations for a collection of documents, one needs an annotation schema consisting of a semantic model (a.k.a. ontology) along with lists of linguistic indicators (keywords and patterns) for each concept in the ontology. The focus of this paper is the automatic generation of the linguistic indicators for a given semantic model and a corpus of documents. Our approach needs a small number of user-defined seeds and bootstraps itself by exploiting a novel clustering technique. The baseline for this work is the Cerno project [8] and the clustering algorithm LIMBO [2]. We also present results that compare the output of the clustering algorithm with linguistic indicators created manually for two case studies.

Automating the Generation of Semantic Annotation Tools Using a Clustering Technique

Souza, Vitor;Zeni, Nicola;Kiyavitskaya, Nadzeya;Andritsos, Periklis;Mich, Luisa;Mylopoulos, John
2008-01-01

Abstract

In order to generate semantic annotations for a collection of documents, one needs an annotation schema consisting of a semantic model (a.k.a. ontology) along with lists of linguistic indicators (keywords and patterns) for each concept in the ontology. The focus of this paper is the automatic generation of the linguistic indicators for a given semantic model and a corpus of documents. Our approach needs a small number of user-defined seeds and bootstraps itself by exploiting a novel clustering technique. The baseline for this work is the Cerno project [8] and the clustering algorithm LIMBO [2]. We also present results that compare the output of the clustering algorithm with linguistic indicators created manually for two case studies.
2008
13th International Conference on Applications of Natural Language to Information Systems Proceedings
Berlin; Heidelberg
Springer
3540698574
9783540698579
Souza, Vitor; Zeni, Nicola; Kiyavitskaya, Nadzeya; Andritsos, Periklis; Mich, Luisa; Mylopoulos, John
File in questo prodotto:
File Dimensione Formato  
Automating the Generation of Semantic Annotation Tools Using Clustering Techniques.pdf

Open Access dal 01/01/2010

Tipologia: Post-print referato (Refereed author’s manuscript)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 215.62 kB
Formato Adobe PDF
215.62 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/31016
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 2
social impact