We propose a novel approach to the problem of semantic heterogeneity where data are organized into a set of stratified and independent representation layers, namely: conceptual (where a set of unique alinguistic identifiers are connected inside a graph codifying their meaning), language (where sets of synonyms, possibly from multiple languages, annotate concepts), knowledge (in the form of a graph where nodes are entity types and links are properties), and data (in the form of a graph of entities populating the previous knowledge graph). This allows us to state the problem of semantic heterogeneity as a problem of Representation Diversity where the different types of heterogeneity, viz. Conceptual, Language, Knowledge, and Data, are uniformly dealt within each single layer, independently from the others. In this paper we describe the proposed stratified representation of data and the process by which data are first transformed into the target representation, then suitably integrated and then, finally, presented to the user in her preferred format. The proposed framework has been evaluated in various pilot case studies and in a number of industrial data integration problems.

Stratified Data Integration / Giunchiglia, F.; Zamboni, A.; Bagchi, M.; Bocca, S.. - 2873:(2021), pp. 1-15. (Intervento presentato al convegno 2nd International Workshop on Knowledge Graph Construction (KGCW), Co-located with ESWC 2021 tenutosi a Online nel 2021).

Stratified Data Integration

Giunchiglia F.;Zamboni A.;Bagchi M.;Bocca S.
2021-01-01

Abstract

We propose a novel approach to the problem of semantic heterogeneity where data are organized into a set of stratified and independent representation layers, namely: conceptual (where a set of unique alinguistic identifiers are connected inside a graph codifying their meaning), language (where sets of synonyms, possibly from multiple languages, annotate concepts), knowledge (in the form of a graph where nodes are entity types and links are properties), and data (in the form of a graph of entities populating the previous knowledge graph). This allows us to state the problem of semantic heterogeneity as a problem of Representation Diversity where the different types of heterogeneity, viz. Conceptual, Language, Knowledge, and Data, are uniformly dealt within each single layer, independently from the others. In this paper we describe the proposed stratified representation of data and the process by which data are first transformed into the target representation, then suitably integrated and then, finally, presented to the user in her preferred format. The proposed framework has been evaluated in various pilot case studies and in a number of industrial data integration problems.
2021
CEUR Workshop Proceedings
Online
CEUR-WS
Giunchiglia, F.; Zamboni, A.; Bagchi, M.; Bocca, S.
Stratified Data Integration / Giunchiglia, F.; Zamboni, A.; Bagchi, M.; Bocca, S.. - 2873:(2021), pp. 1-15. (Intervento presentato al convegno 2nd International Workshop on Knowledge Graph Construction (KGCW), Co-located with ESWC 2021 tenutosi a Online nel 2021).
File in questo prodotto:
File Dimensione Formato  
2021 ESWC KGCW - SDIntegration.pdf

accesso aperto

Tipologia: Post-print referato (Refereed author’s manuscript)
Licenza: Creative commons
Dimensione 4.39 MB
Formato Adobe PDF
4.39 MB Adobe PDF Visualizza/Apri
paper6.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Creative commons
Dimensione 4.59 MB
Formato Adobe PDF
4.59 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/319644
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact