From Web Directories to Ontologies: Natural Language Processing Challenges

IRIS

Hierarchical classifications are used pervasively by humans as a means to organize their data and knowledge about the world. One of their main advantages is that natural language labels, used to describe their contents, are easily understood by human users. However, at the same time, this is also one of their main disadvantages as these same labels are ambiguous and very hard to be reasoned about by software agents. This fact creates an insuperable hindrance for classifications to being embedded in the Semantic Web infrastructure. This paper presents an approach to converting classifications into lightweight ontologies, and it makes the following contributions: (i) it identifies the main NLP problems related to the conversion process and shows how they are different from the classical problems of NLP; (ii) it proposes heuristic solutions to these problems, which are especially effective in this domain; and (iii) it evaluates the proposed solutions by testing them on DMoz data.

From Web Directories to Ontologies: Natural Language Processing Challenges

I. Zaihrayeu;L. Su;Giunchiglia, Fausto;P. Wei;J. Qi;C. Mingmin;H. Xuanjing

2007-01-01

Abstract

Hierarchical classifications are used pervasively by humans as a means to organize their data and knowledge about the world. One of their main advantages is that natural language labels, used to describe their contents, are easily understood by human users. However, at the same time, this is also one of their main disadvantages as these same labels are ambiguous and very hard to be reasoned about by software agents. This fact creates an insuperable hindrance for classifications to being embedded in the Semantic Web infrastructure. This paper presents an approach to converting classifications into lightweight ontologies, and it makes the following contributions: (i) it identifies the main NLP problems related to the conversion process and shows how they are different from the classical problems of NLP; (ii) it proposes heuristic solutions to these problems, which are especially effective in this domain; and (iii) it evaluates the proposed solutions by testing them on DMoz data.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2007
			
	Titolo del volume (Proceedings title)
	
				Proceedings of the 6th International Semantic Web Conference and the 2nd Asian Semantic Web Conference, (ISWC'07/ + ASWC'07)
			
	Luogo di edizione (Place of publication)
	
				Berlin, Heidelberg
			
	Casa editrice (Publisher)
	
				SPRINGER-VERLAG BERLIN
			
	ISBN
	
				9783540762973
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-49949083688
			
	Codice WOS (WOS identifier)
	
				WOS:000251080500045
			
	Tutti gli autori
	
						I., Zaihrayeu; L., Su; Giunchiglia, Fausto; P., Wei; J., Qi; C., Mingmin; H., Xuanjing
					
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/14251

Citazioni

ND

26

15

71

social impact