A Large Scale Dataset for the Evaluation of Ontology Matching Systems

IRIS

Recently, the number of ontology matching techniques and systems has increased significantly. This makes the issue of their evaluation and comparison more severe. One of the challenges of the ontology matching evaluation is in building large scale evaluation datasets. In fact, the number of possible correspondences between two ontologies grows quadratically with respect to the numbers of entities in these ontologies. This often makes the manual construction of the evaluation datasets demanding to the point of being infeasible for large scale matching tasks. In this paper we present an ontology matching evaluation dataset composed of thousands of matching tasks, called TaxME2. It was built semi-automatically out of the Google, Yahoo and Looksmart web directories. We evaluated TaxME2 by exploiting the results of almost two dozen of state of the art ontology matching systems. The experiments indicate that the dataset possesses the desired key properties, namely it is error-free, incremental, discriminative, monotonic, and hard for the state of the art ontology matching systems.

A Large Scale Dataset for the Evaluation of Ontology Matching Systems / Giunchiglia, F., Yatskevich, M., Avesani, P., Shvaiko, P.. - ELETTRONICO. - (2008), pp. 1-24.

A Large Scale Dataset for the Evaluation of Ontology Matching Systems

Giunchiglia, Fausto^Primo;Yatskevich, Mikalai^Secondo;Avesani, Paolo^Penultimo;Shvaiko, Pavel^Ultimo

2008-01-01

Abstract

Recently, the number of ontology matching techniques and systems has increased significantly. This makes the issue of their evaluation and comparison more severe. One of the challenges of the ontology matching evaluation is in building large scale evaluation datasets. In fact, the number of possible correspondences between two ontologies grows quadratically with respect to the numbers of entities in these ontologies. This often makes the manual construction of the evaluation datasets demanding to the point of being infeasible for large scale matching tasks. In this paper we present an ontology matching evaluation dataset composed of thousands of matching tasks, called TaxME2. It was built semi-automatically out of the Google, Yahoo and Looksmart web directories. We evaluated TaxME2 by exploiting the results of almost two dozen of state of the art ontology matching systems. The experiments indicate that the dataset possesses the desired key properties, namely it is error-free, incremental, discriminative, monotonic, and hard for the state of the art ontology matching systems.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2008
			
	Luogo di edizione (Place of publication)
	
				Trento
			
	Casa editrice (Publisher)
	
				Università degli Studi di Trento, Dipartimento di Ingegneria e Scienza dell'Informazione
			
	Citazione
	
				A Large Scale Dataset for the Evaluation of Ontology Matching Systems / Giunchiglia, F., Yatskevich, M., Avesani, P., Shvaiko, P.. - ELETTRONICO. - (2008), pp. 1-24.
			
	Tutti gli autori
	
						Giunchiglia, Fausto; Yatskevich, Mikalai; Avesani, Paolo; Shvaiko, Pavel
					
	Appare nelle tipologie:
	
				07.2 Altre pubblicazioni (Other types of publications)

File in questo prodotto:

File	Dimensione	Formato
001.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 4.09 MB Formato Adobe PDF Visualizza/Apri	4.09 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/359604

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

ND

ND

ND

social impact