Understanding metadata written in natural language is a premise to successful automated integration of large scale language-rich datasets, such as digital libraries. In this paper we describe an analysis of the part of speech structure of two different datasets of metadata, show how this structure can be used to detect structural patterns that can be parsed by lightweight grammars with an accuracy ranging from 95.3% to 99.8%. This allows deeper understanding of metadata semantics, important for such tasks as translating classifications into lightweight ontologies for use in semantic matching.
Lightweight Parsing of Natural Language Metadata / Autayeu, Aliaksandr; Andrews, Pierre; Ju, Qi; Giunchiglia, Fausto. - ELETTRONICO. - (2009), pp. 1-5.
Lightweight Parsing of Natural Language Metadata
Autayeu, Aliaksandr;Andrews, Pierre;Ju, Qi;Giunchiglia, Fausto
2009-01-01
Abstract
Understanding metadata written in natural language is a premise to successful automated integration of large scale language-rich datasets, such as digital libraries. In this paper we describe an analysis of the part of speech structure of two different datasets of metadata, show how this structure can be used to detect structural patterns that can be parsed by lightweight grammars with an accuracy ranging from 95.3% to 99.8%. This allows deeper understanding of metadata semantics, important for such tasks as translating classifications into lightweight ontologies for use in semantic matching.File | Dimensione | Formato | |
---|---|---|---|
028.pdf
accesso aperto
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
94.07 kB
Formato
Adobe PDF
|
94.07 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione