Graph pattern mining aims at identifying structures that appear frequently in large graphs, under the assumption that frequency signifies importance. In real life, there are many graphs with weights on nodes and/or edges. For these graphs, it is fair that the importance (score) of a pattern is determined not only by the number of its appearances, but also by the weights on the nodes/edges of those appearances. Scoring functions based on the weights do not generally satisfy the apriori property, which guarantees that the number of appearances of a pattern cannot be larger than the frequency of any of its sub-patterns, and hence allows faster pruning. Therefore, existing approaches employ other, less efficient, pruning strategies. The problem becomes even more challenging in the case of multiple weighting functions that assign different weights to the same nodes/edges. In this work we propose a new family of scoring functions that respects the apriori property, and thus can rely on effec...
Mining patterns in graphs with multiple weights / Preti, Giulia; Lissandrini, Matteo; Mottin, Davide; Velegrakis, Yannis. - In: DISTRIBUTED AND PARALLEL DATABASES. - ISSN 0926-8782. - 39:2(2021), pp. 281-319. [10.1007/s10619-019-07259-w]
Mining patterns in graphs with multiple weights
Preti, Giulia;Lissandrini, Matteo;Mottin, Davide;Velegrakis, Yannis
2021-01-01
Abstract
Graph pattern mining aims at identifying structures that appear frequently in large graphs, under the assumption that frequency signifies importance. In real life, there are many graphs with weights on nodes and/or edges. For these graphs, it is fair that the importance (score) of a pattern is determined not only by the number of its appearances, but also by the weights on the nodes/edges of those appearances. Scoring functions based on the weights do not generally satisfy the apriori property, which guarantees that the number of appearances of a pattern cannot be larger than the frequency of any of its sub-patterns, and hence allows faster pruning. Therefore, existing approaches employ other, less efficient, pruning strategies. The problem becomes even more challenging in the case of multiple weighting functions that assign different weights to the same nodes/edges. In this work we propose a new family of scoring functions that respects the apriori property, and thus can rely on effec...I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione



