Understanding metadata written in natural language is a crucial requirement towards the successful automated integration of large scale, language-rich, classifications such as the ones used in digital libraries. In this article we analyze natural language labels used in such classifications by exploring their syntactic structure, and then we show how this structure can be used to detect patterns of language that can be processed by a lightweight parser whose average accuracy is 96.82%. This allows for a deep understanding of natural language metadata semantics. In particular we show how we improve the accuracy of the automatic translation of classifications into lightweight ontologies by almost 18% with respect to the previously used approach. The automatic translation is required by applications such as semantic matching, search and classification algorithms.

Lightweight Parsing of Classifications / Autayeu, Aliaksandr; Andrews, Pierre; Giunchiglia, Fausto. - ELETTRONICO. - (2010), pp. 1-14.

Lightweight Parsing of Classifications

Autayeu, Aliaksandr;Andrews, Pierre;Giunchiglia, Fausto
2010-01-01

Abstract

Understanding metadata written in natural language is a crucial requirement towards the successful automated integration of large scale, language-rich, classifications such as the ones used in digital libraries. In this article we analyze natural language labels used in such classifications by exploring their syntactic structure, and then we show how this structure can be used to detect patterns of language that can be processed by a lightweight parser whose average accuracy is 96.82%. This allows for a deep understanding of natural language metadata semantics. In particular we show how we improve the accuracy of the automatic translation of classifications into lightweight ontologies by almost 18% with respect to the previously used approach. The automatic translation is required by applications such as semantic matching, search and classification algorithms.
2010
Trento
University of Trento - Dipartimento di Ingegneria e Scienza dell'Informazione
Lightweight Parsing of Classifications / Autayeu, Aliaksandr; Andrews, Pierre; Giunchiglia, Fausto. - ELETTRONICO. - (2010), pp. 1-14.
Autayeu, Aliaksandr; Andrews, Pierre; Giunchiglia, Fausto
File in questo prodotto:
File Dimensione Formato  
068.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 304.07 kB
Formato Adobe PDF
304.07 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/358285
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact