Supervised models for multimodal image retrieval based on visual, semantic and geographic information