Names are studied in different fields, and, among the issues they present,name variations (e.g., translations, misspellings, etc...) and name variants (e.g., pseudonyms) pose a challenge to name matching, i.e., discovering instances that differ typographically but represent the same entity. Our scenario for name matching is a P2P, entity-based network of users divided in local level (the users), community level (groups of users), and global level(all the entities). Entities at local level are a partial view of the real word entity, represented at the global level. In this framework, name variations and name variants change the orthography of names because of linguistic and social factors, and their presence depends on the scenario level considered. Thus, they are hard to tackle by an automatic approach such as name matching. Our proposed solutions is to use a taxonomy we created to understand and predict the variations and variants of different entity names, and divide the entity name in different entries to accommodate the original name plus variations and variants. Our approach is novel because we take advantage of a multidisciplinary method, drawing from various fields (i.e., philosophy, sociology and geography) importing terms and views not found in computer science. We also draw from areas close to name matching, building from their findings and expanding them.

Semantic Name Matching / Bignotti, Enrico. - ELETTRONICO. - (2013), pp. 1-98.

Semantic Name Matching

Enrico, Bignotti
2013-01-01

Abstract

Names are studied in different fields, and, among the issues they present,name variations (e.g., translations, misspellings, etc...) and name variants (e.g., pseudonyms) pose a challenge to name matching, i.e., discovering instances that differ typographically but represent the same entity. Our scenario for name matching is a P2P, entity-based network of users divided in local level (the users), community level (groups of users), and global level(all the entities). Entities at local level are a partial view of the real word entity, represented at the global level. In this framework, name variations and name variants change the orthography of names because of linguistic and social factors, and their presence depends on the scenario level considered. Thus, they are hard to tackle by an automatic approach such as name matching. Our proposed solutions is to use a taxonomy we created to understand and predict the variations and variants of different entity names, and divide the entity name in different entries to accommodate the original name plus variations and variants. Our approach is novel because we take advantage of a multidisciplinary method, drawing from various fields (i.e., philosophy, sociology and geography) importing terms and views not found in computer science. We also draw from areas close to name matching, building from their findings and expanding them.
2013
Trento
Università degli Studi di Trento, Dipartimento di Ingegneria e Scienza dell'Informazione
Semantic Name Matching / Bignotti, Enrico. - ELETTRONICO. - (2013), pp. 1-98.
Bignotti, Enrico
File in questo prodotto:
File Dimensione Formato  
TP_tesi.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 793.32 kB
Formato Adobe PDF
793.32 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/359084
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact