In document search, documents are typically seen as a flat list of keywords. To deal with the syntactic interoperability, i.e., the use of different keywords to refer to the same real world entity, entity linkage has been used to replace keywords in the text with a unique identifier of the entity to which they are referring. Yet, the flat list of entities fails to capture the actual relationships that exist among the entities, information that is significant for a more effective document search. In this work we propose to go one step further from entity linkage in text, and model the documents as a set of structures that describe relationships among the entities mentioned in the text. We show that this kind of representation is significantly improving the effectiveness of document search. We describe the details of the implementation of the above idea and we present an extensive set of experimental results that prove our point.

Entity-based keyword search in web documents / Sartori, Enrico; Velegrakis, Ioannis; Guerra, Francesco. - In: TRANSACTIONS ON COMPUTATIONAL COLLECTIVE INTELLIGENCE. - ISSN 2190-9288. - 9630:(2016), pp. 21-49. [10.1007/978-3-662-49521-6_2]

Entity-based keyword search in web documents

Sartori, Enrico;Velegrakis, Ioannis;
2016-01-01

Abstract

In document search, documents are typically seen as a flat list of keywords. To deal with the syntactic interoperability, i.e., the use of different keywords to refer to the same real world entity, entity linkage has been used to replace keywords in the text with a unique identifier of the entity to which they are referring. Yet, the flat list of entities fails to capture the actual relationships that exist among the entities, information that is significant for a more effective document search. In this work we propose to go one step further from entity linkage in text, and model the documents as a set of structures that describe relationships among the entities mentioned in the text. We show that this kind of representation is significantly improving the effectiveness of document search. We describe the details of the implementation of the above idea and we present an extensive set of experimental results that prove our point.
2016
Sartori, Enrico; Velegrakis, Ioannis; Guerra, Francesco
Entity-based keyword search in web documents / Sartori, Enrico; Velegrakis, Ioannis; Guerra, Francesco. - In: TRANSACTIONS ON COMPUTATIONAL COLLECTIVE INTELLIGENCE. - ISSN 2190-9288. - 9630:(2016), pp. 21-49. [10.1007/978-3-662-49521-6_2]
File in questo prodotto:
File Dimensione Formato  
SartoriVG16.pdf

Solo gestori archivio

Tipologia: Post-print referato (Refereed author’s manuscript)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 694.71 kB
Formato Adobe PDF
694.71 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/164702
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 4
social impact