Navigating the Topical Structure of Academic Search Results via the Wikipedia Category Network

IRIS

Searching for scientific publications on the Web is a tedious task, especially when exploring an unfamiliar domain. Typical scholarly search engines produce lengthy unstructured result lists that are difficult to comprehend, interpret and browse. We propose a novel method of organizing the search results into concise and informative topic hierarchies. The method consists of two steps: extracting interrelated topics from the result set, and summarizing the topic graph. In the first step we map the search results to articles and categories of Wikipedia, constructing a graph of relevant topics with hierarchical relations. In the second step we sequentially build nested summaries of the produced topic graph using a structured output prediction approach. Trained on a small number of examples, our method learns to construct informative summaries for unseen topic graphs, and outperforms unsupervised state-of-the-art Wikipedia-based clustering. Copyright is held by the owner/author(s).

Navigating the Topical Structure of Academic Search Results via the Wikipedia Category Network

D. Mirylenka;Passerini, Andrea

2013-01-01

Abstract

Searching for scientific publications on the Web is a tedious task, especially when exploring an unfamiliar domain. Typical scholarly search engines produce lengthy unstructured result lists that are difficult to comprehend, interpret and browse. We propose a novel method of organizing the search results into concise and informative topic hierarchies. The method consists of two steps: extracting interrelated topics from the result set, and summarizing the topic graph. In the first step we map the search results to articles and categories of Wikipedia, constructing a graph of relevant topics with hierarchical relations. In the second step we sequentially build nested summaries of the produced topic graph using a structured output prediction approach. Trained on a small number of examples, our method learns to construct informative summaries for unseen topic graphs, and outperforms unsupervised state-of-the-art Wikipedia-based clustering. Copyright is held by the owner/author(s).

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2013
			
	Titolo del volume (Proceedings title)
	
				Proceedings of the 22Nd ACM International Conference on Information Knowledge Management
			
	Luogo di edizione (Place of publication)
	
				New York, NY, USA
			
	Casa editrice (Publisher)
	
				ACM
			
	ISBN
	
				9781450322638
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-84889566163
			
	Codice WOS (WOS identifier)
	
				WOS:000722225900104
			
	Tutti gli autori
	
						D., Mirylenka; Passerini, Andrea
					
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/67305

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

11

6

9

social impact