The ability of identifying whether two strings represent names referring to the same real world entity is essential for avoiding information integration problems, such as duplication of records. We study this problem in a scenario where the amount of data to analyze becomes large. Our purpose is to develop a framework that address the name match and search problem, combining together different strategies, and is able to consider also the semantic of the string representing a name. Moreover we propose a dataset for evaluating name matching algorithm which consider semantic variation of names.
A Large Scale Name Matching and Search Framework / Margonar, Stella. - ELETTRONICO. - (2013), pp. 1-106.
A Large Scale Name Matching and Search Framework
Margonar, Stella
2013-01-01
Abstract
The ability of identifying whether two strings represent names referring to the same real world entity is essential for avoiding information integration problems, such as duplication of records. We study this problem in a scenario where the amount of data to analyze becomes large. Our purpose is to develop a framework that address the name match and search problem, combining together different strategies, and is able to consider also the semantic of the string representing a name. Moreover we propose a dataset for evaluating name matching algorithm which consider semantic variation of names.File | Dimensione | Formato | |
---|---|---|---|
DISI-13-026.pdf
accesso aperto
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
2.67 MB
Formato
Adobe PDF
|
2.67 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione