The problem of detecting frequent items in streaming data is relevant to many different applications across many domains. Several algorithms, diverse in nature, have been proposed in the literature for the solution of the above problem. In this paper, we review these algorithms, and we present the results of the first extensive comparative experimental study of the most prominent algorithms in the literature. The algorithms were comprehensively tested using a common test framework on a variety of real and synthetic data. Their performance with respect to the different parameters (i.e., parameters intrinsic to the algorithms, and data related parameters) was studied. We report the results, and insights gained through these experiments. This work has been published in the Data and Knowledge Engineering (DKE) journal. Please reference it as follows: Nishad Manerikar, Themis Palpanas. Frequent Items in Streaming Data: An Experimental Evaluation of the State-of-the-Art. Data and Knowledge Engineering (DKE) 68(4), 2009: 415-430

Frequent Items in Streaming Data: An Experimental Evaluation of the State-of-the-Art / Manerikar, Nishad; Palpanas, Themis. - ELETTRONICO. - (2008), pp. 1-27.

Frequent Items in Streaming Data: An Experimental Evaluation of the State-of-the-Art

Palpanas, Themis
2008-01-01

Abstract

The problem of detecting frequent items in streaming data is relevant to many different applications across many domains. Several algorithms, diverse in nature, have been proposed in the literature for the solution of the above problem. In this paper, we review these algorithms, and we present the results of the first extensive comparative experimental study of the most prominent algorithms in the literature. The algorithms were comprehensively tested using a common test framework on a variety of real and synthetic data. Their performance with respect to the different parameters (i.e., parameters intrinsic to the algorithms, and data related parameters) was studied. We report the results, and insights gained through these experiments. This work has been published in the Data and Knowledge Engineering (DKE) journal. Please reference it as follows: Nishad Manerikar, Themis Palpanas. Frequent Items in Streaming Data: An Experimental Evaluation of the State-of-the-Art. Data and Knowledge Engineering (DKE) 68(4), 2009: 415-430
2008
Trento
Università degli Studi di Trento
Frequent Items in Streaming Data: An Experimental Evaluation of the State-of-the-Art / Manerikar, Nishad; Palpanas, Themis. - ELETTRONICO. - (2008), pp. 1-27.
Manerikar, Nishad; Palpanas, Themis
File in questo prodotto:
File Dimensione Formato  
017.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 241.48 kB
Formato Adobe PDF
241.48 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/357845
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact