Classifying imbalanced data sets using similarity based hierarchical decomposition