Efficiently Discovering Recent Frequent Items In Data Streams

Tantono, F. I.; Manerikar, N.; Palpanas, Themistoklis

doi:10.1007/978-3-540-69497-7_16

The problem of frequent item discovery in streaming data has attracted a lot of attention lately. While the above problem has been studied extensively, and several techniques have been proposed for its solution, these approaches treat all the values of the data stream equally. Nevertheless, not all values are of equal importance. In several situations, we are interested more in the new values that have appeared in the stream, rather than in the older ones. In this paper, we address the problem of finding <em>recent</em>frequent items in a data stream given a small bounded memory, and present novel algorithms to this direction. We propose a basic algorithm that extends the functionality of existing approaches by monitoring item frequencies in recent windows. Subsequently, we present an improved version of the algorithm with significantly improved performance (in terms of accuracy), at no extra memory cost. Finally, we perform an extensive experimental evaluation, and show that the proposed algorithms can efficiently identify the frequent items in ad hoc recent windows of a data stream.

Efficiently Discovering Recent Frequent Items In Data Streams

F. I. Tantono;N. Manerikar;Palpanas, Themistoklis

2008-01-01

Abstract

The problem of frequent item discovery in streaming data has attracted a lot of attention lately. While the above problem has been studied extensively, and several techniques have been proposed for its solution, these approaches treat all the values of the data stream equally. Nevertheless, not all values are of equal importance. In several situations, we are interested more in the new values that have appeared in the stream, rather than in the older ones. In this paper, we address the problem of finding recentfrequent items in a data stream given a small bounded memory, and present novel algorithms to this direction. We propose a basic algorithm that extends the functionality of existing approaches by monitoring item frequencies in recent windows. Subsequently, we present an improved version of the algorithm with significantly improved performance (in terms of accuracy), at no extra memory cost. Finally, we perform an extensive experimental evaluation, and show that the proposed algorithms can efficiently identify the frequent items in ad hoc recent windows of a data stream.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2008
			
	Titolo del volume (Proceedings title)
	
				International Conference on Scientific and Statistical DataBase Management (SSDBM)
			
	Luogo di edizione (Place of publication)
	
				Berlin
			
	Casa editrice (Publisher)
	
				Springer
			
	ISBN
	
				9783540694762
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-49049103988
			
	Tutti gli autori
	
						F. I., Tantono; N., Manerikar; Palpanas, Themistoklis
					
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/75049

Efficiently Discovering Recent Frequent Items In Data Streams

F. I. Tantono;N. Manerikar;Palpanas, Themistoklis

2008-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

Efficiently Discovering Recent Frequent Items In Data Streams

F. I. Tantono;N. Manerikar;Palpanas, Themistoklis

2008-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)