Data preprocessing in SVM-based keywords extraction from scientific documents