The rise of process data availability has recently led to the development of data-driven learning approaches. However, most of these approaches restrict the use of the learned model to predict the future of ongoing process executions. The goal of this paper is moving a step forward and leveraging available data to learning to act, by supporting users with recommendations derived from an optimal strategy (measure of performance). We take the optimization perspective of one process actor and we recommend the best activities to execute next, in response to what happens in a complex external environment, where there is no control on exogenous factors. To this aim, we investigate an approach that learns, by means of Reinforcement Learning, the optimal policy from the observation of past executions and recommends the best activities to carry on for optimizing a Key Performance Indicator of interest. The validity of the approach is demonstrated on two scenarios taken from real-life data.
Learning to Act: A Reinforcement Learning Approach to Recommend the Best Next Activities / Branchi, Stefano; Di Francescomarino, Chiara; Ghidini, Chiara; Massimo, David; Ricci, Francesco; Ronzani, Massimiliano. - 458:(2022), pp. 137-154. (Intervento presentato al convegno Business Process Management Forum, BPM 2022 Forum tenutosi a Münster, Germany nel September 11–16, 2022) [10.1007/978-3-031-16171-1_9].
Learning to Act: A Reinforcement Learning Approach to Recommend the Best Next Activities
Di Francescomarino, Chiara;
2022-01-01
Abstract
The rise of process data availability has recently led to the development of data-driven learning approaches. However, most of these approaches restrict the use of the learned model to predict the future of ongoing process executions. The goal of this paper is moving a step forward and leveraging available data to learning to act, by supporting users with recommendations derived from an optimal strategy (measure of performance). We take the optimization perspective of one process actor and we recommend the best activities to execute next, in response to what happens in a complex external environment, where there is no control on exogenous factors. To this aim, we investigate an approach that learns, by means of Reinforcement Learning, the optimal policy from the observation of past executions and recommends the best activities to carry on for optimizing a Key Performance Indicator of interest. The validity of the approach is demonstrated on two scenarios taken from real-life data.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione