A Ground-Truth Training Set for Hierarchical Clustering in Content-Based Image Retrieval