In order to generate semantic annotations for a collection of documents, one needs an annotation schema consisting of a semantic model (a.k.a. ontology) along with lists of linguistic indicators (keywords and patterns) for each concept in the ontology. The focus of this paper is the automatic generation of the linguistic indicators for a given semantic model and a corpus of documents. Our approach needs a small number of user-defined seeds and bootstraps itself by exploiting a novel clustering technique. The baseline for this work is the Cerno project [8] and the clustering algorithm LIMBO [2]. We also present results that compare the output of the clustering algorithm with linguistic indicators created manually for two case studies. © 2008 Springer-Verlag Berlin Heidelberg.
Titolo: | Automating the Generation of Semantic Annotation Tools Using a Clustering Technique | |
Autori: | V., Souza; Zeni, Nicola; Kiyavitskaya, Nadzeya; P., Andritsos; Mich, Luisa; Mylopoulos, Ioannis | |
Autori Unitn: | ||
Titolo del volume contenente il saggio: | NLDB08 | |
Luogo di edizione: | Berlin Heidelberg | |
Casa editrice: | Springer | |
Anno di pubblicazione: | 2008 | |
Codice identificativo Scopus: | 2-s2.0-47749107271 | |
Codice identificativo WOS: | WOS:000257180400009 | |
ISBN: | 3540698574 9783540698579 | |
Handle: | http://hdl.handle.net/11572/31016 | |
Appare nelle tipologie: | 04.1 Saggio in atti di convegno (Paper in Proceedings) |