Understanding metadata written in natural language is a premise to successful automated integration of large scale, language-rich, classications such as the ones used in digital libraries. We analyze the natural language labels within classication by exploring their syntactic structure, we then show how this structure can be used to detect patterns of language that can be processed by a lightweight parser with an average accuracy of 96.82%. This allows for a deeper understanding of natural language metadata semantics, which we show can improve by almost 18% the accuracy of the automatic translation of classications into lightweight ontologies required by semantic matching, search and classication algorithms.
Lightweight Parsing of Classications into Lightweight Ontologies / Autayeu, Aliaksandr; Andrews, Pierre; Giunchiglia, Fausto. - ELETTRONICO. - (2010), pp. 1-12.
Lightweight Parsing of Classications into Lightweight Ontologies
Autayeu, Aliaksandr;Andrews, Pierre;Giunchiglia, Fausto
2010-01-01
Abstract
Understanding metadata written in natural language is a premise to successful automated integration of large scale, language-rich, classications such as the ones used in digital libraries. We analyze the natural language labels within classication by exploring their syntactic structure, we then show how this structure can be used to detect patterns of language that can be processed by a lightweight parser with an average accuracy of 96.82%. This allows for a deeper understanding of natural language metadata semantics, which we show can improve by almost 18% the accuracy of the automatic translation of classications into lightweight ontologies required by semantic matching, search and classication algorithms.File | Dimensione | Formato | |
---|---|---|---|
025.pdf
accesso aperto
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
290 kB
Formato
Adobe PDF
|
290 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione