Robust Cross-Media Transfer for Visual Event Detection