This paper addresses the following predictive business process monitoring problem: Given the execution trace of an ongoing case,and given a set of traces of historical (completed) cases, predict the most likely outcome of the ongoing case. In this context, a trace refers to a sequence of events with corresponding payloads, where a payload consists of a set of attribute-value pairs. Meanwhile, an outcome refers to a label associated to completed cases, like, for example, a label indicating that a given case completed “on time” (with respect to a given desired duration) or “late”, or a label indicating that a given case led to a customer complaint or not. The paper tackles this problem via a two-phased approach. In the first phase, prefixes of historical cases are encoded using complex symbolic sequences and clustered. In the second phase, a classifier is built for each of the clusters. To predict the outcome of an ongoing case at runtime given its (uncompleted) trace, we select the closest cluster(s) to the trace in question and apply the respective classifier(s), taking into account the Euclidean distance of the trace from the center of the clusters. We consider two families of clustering algorithms – hierarchical clustering and k-medoids – and use random forests for classification. The approach was evaluated on four real-life datasets.

Complex Symbolic Sequence Clustering and Multiple Classifiers for Predictive Process Monitoring / Verenich, Ilya; Dumas, Marlon; La Rosa, Marcello; Maggi, Fabrizio Maria; Di Francescomarino, Chiara. - 256:(2016), pp. 218-229. (Intervento presentato al convegno 13th International Business Process Management Workshops (BPM 2015 Workshops) tenutosi a Vienna, Austria nel August 31 - September 3, 2015) [10.1007/978-3-319-42887-1_18].

Complex Symbolic Sequence Clustering and Multiple Classifiers for Predictive Process Monitoring

Di Francescomarino, Chiara
2016-01-01

Abstract

This paper addresses the following predictive business process monitoring problem: Given the execution trace of an ongoing case,and given a set of traces of historical (completed) cases, predict the most likely outcome of the ongoing case. In this context, a trace refers to a sequence of events with corresponding payloads, where a payload consists of a set of attribute-value pairs. Meanwhile, an outcome refers to a label associated to completed cases, like, for example, a label indicating that a given case completed “on time” (with respect to a given desired duration) or “late”, or a label indicating that a given case led to a customer complaint or not. The paper tackles this problem via a two-phased approach. In the first phase, prefixes of historical cases are encoded using complex symbolic sequences and clustered. In the second phase, a classifier is built for each of the clusters. To predict the outcome of an ongoing case at runtime given its (uncompleted) trace, we select the closest cluster(s) to the trace in question and apply the respective classifier(s), taking into account the Euclidean distance of the trace from the center of the clusters. We consider two families of clustering algorithms – hierarchical clustering and k-medoids – and use random forests for classification. The approach was evaluated on four real-life datasets.
2016
Proceedings of the 13th International Business Process Management Workshops (BPM 2015 Workshops)
Springer
978-3-319-42886-4
Verenich, Ilya; Dumas, Marlon; La Rosa, Marcello; Maggi, Fabrizio Maria; Di Francescomarino, Chiara
Complex Symbolic Sequence Clustering and Multiple Classifiers for Predictive Process Monitoring / Verenich, Ilya; Dumas, Marlon; La Rosa, Marcello; Maggi, Fabrizio Maria; Di Francescomarino, Chiara. - 256:(2016), pp. 218-229. (Intervento presentato al convegno 13th International Business Process Management Workshops (BPM 2015 Workshops) tenutosi a Vienna, Austria nel August 31 - September 3, 2015) [10.1007/978-3-319-42887-1_18].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/362712
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 34
  • ???jsp.display-item.citation.isi??? 24
social impact