INCREASINGLY SPECIALIZED ENSEMBLE OF CONVOLUTIONAL NEURAL NETWORKS FOR FINE-GRAINED RECOGNITION