Scalable Similarity Matching in Streaming Time Series

IRIS

Nowadays online monitoring of data streams is essential in many real life applications, like sensor network monitoring, manufacturing process control, and video surveillance. One major problem in this area is the online identification of streaming sequences similar to a predefined set of pattern-sequences. In this paper, we present a novel solution that extends the state of the art both in terms of effectiveness and efficiency. We propose the first online similarity matching algorithm based on Longest Common SubSequence that is specifically designed to operate in a streaming context, and that can effectively handle time scaling, as well as noisy data. In order to deal with high stream rates and multiple streams, we extend the algorithm to operate on multilevel approximations of the streaming data, therefore quickly pruning the search space. Finally, we incorporate in our approach error estimation mechanisms in order to reduce the number of false negatives. We perform an extensive experimental evaluation using forty real datasets, diverse in nature and characteristics, and we also compare our approach to previous techniques. The experiments demonstrate the validity of our approach. The original publication is available in PAKDD 2012, Proceedings in Lecture Notes in Artificial Intelligence (LNAI), Springer Verlag (www.springerlink.com).

Scalable Similarity Matching in Streaming Time Series / Marascu, A., Ali Khan, S., Palpanas, T.. - ELETTRONICO. - (2011).

Scalable Similarity Matching in Streaming Time Series

Marascu, Alice^Primo;Ali Khan, Suleiman^Secondo;Palpanas, Themis^Ultimo

2011-01-01

Abstract

Nowadays online monitoring of data streams is essential in many real life applications, like sensor network monitoring, manufacturing process control, and video surveillance. One major problem in this area is the online identification of streaming sequences similar to a predefined set of pattern-sequences. In this paper, we present a novel solution that extends the state of the art both in terms of effectiveness and efficiency. We propose the first online similarity matching algorithm based on Longest Common SubSequence that is specifically designed to operate in a streaming context, and that can effectively handle time scaling, as well as noisy data. In order to deal with high stream rates and multiple streams, we extend the algorithm to operate on multilevel approximations of the streaming data, therefore quickly pruning the search space. Finally, we incorporate in our approach error estimation mechanisms in order to reduce the number of false negatives. We perform an extensive experimental evaluation using forty real datasets, diverse in nature and characteristics, and we also compare our approach to previous techniques. The experiments demonstrate the validity of our approach. The original publication is available in PAKDD 2012, Proceedings in Lecture Notes in Artificial Intelligence (LNAI), Springer Verlag (www.springerlink.com).

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2011
			
	Luogo di edizione (Place of publication)
	
				Trento
			
	Casa editrice (Publisher)
	
				Università degli Studi di Trento, Dipartimento di Ingegneria e Scienza dell'Informazione
			
	Citazione
	
				Scalable Similarity Matching in Streaming Time Series / Marascu, A., Ali Khan, S., Palpanas, T.. - ELETTRONICO. - (2011).
			
	Tutti gli autori
	
						Marascu, Alice; Ali Khan, Suleiman; Palpanas, Themis
					
	Appare nelle tipologie:
	
				07.2 Altre pubblicazioni (Other types of publications)

File in questo prodotto:

File	Dimensione	Formato
techRep484.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 743.81 kB Formato Adobe PDF Visualizza/Apri	743.81 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/359707

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

ND

ND

ND

social impact