GLocal Structural Feature Selection with Sparsity for Multimedia Data Understanding