Scientific documents are unstructured data consisting of natural language and hard for scientists to read and manage. Keywords are very helpful for scientists to search the related documents and know about their contents in a prompt way. In this paper we investigate a kind of data preprocessing technique used in SVM-based keyword extraction from scientific documents. Four definitions of regular scientific documents are proposed, and the analysis on the experimental results is performed based on the proposed definitions. The experimental results confirm the intuition that abstract is important for keywords extraction. © 2009 IEEE.

Data preprocessing in SVM-based keywords extraction from scientific documents / Wu, Chunguo; Marchese, Maurizio; Wang, Yufei; Krapivin, Mikalai; Wang, Chaoyong; Li, Xitong; Liang, Yanchun. - (2009), pp. 810-813. (Intervento presentato al convegno 2009 4th International Conference on Innovative Computing, Information and Control, ICICIC 2009 tenutosi a Kaohsiung, Taiwan nel 2009) [10.1109/ICICIC.2009.155].

Data preprocessing in SVM-based keywords extraction from scientific documents

Marchese, Maurizio;Krapivin, Mikalai;Liang, Yanchun
2009-01-01

Abstract

Scientific documents are unstructured data consisting of natural language and hard for scientists to read and manage. Keywords are very helpful for scientists to search the related documents and know about their contents in a prompt way. In this paper we investigate a kind of data preprocessing technique used in SVM-based keyword extraction from scientific documents. Four definitions of regular scientific documents are proposed, and the analysis on the experimental results is performed based on the proposed definitions. The experimental results confirm the intuition that abstract is important for keywords extraction. © 2009 IEEE.
2009
2009 4th International Conference on Innovative Computing, Information and Control, ICICIC 2009
NEW YORK
IEEE
9780769538730
Wu, Chunguo; Marchese, Maurizio; Wang, Yufei; Krapivin, Mikalai; Wang, Chaoyong; Li, Xitong; Liang, Yanchun
Data preprocessing in SVM-based keywords extraction from scientific documents / Wu, Chunguo; Marchese, Maurizio; Wang, Yufei; Krapivin, Mikalai; Wang, Chaoyong; Li, Xitong; Liang, Yanchun. - (2009), pp. 810-813. (Intervento presentato al convegno 2009 4th International Conference on Innovative Computing, Information and Control, ICICIC 2009 tenutosi a Kaohsiung, Taiwan nel 2009) [10.1109/ICICIC.2009.155].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/189165
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 6
  • ???jsp.display-item.citation.isi??? ND
social impact