Scientific documents are unstructured data consisting of natural language and hard for scientists to read and manage. Keywords are very helpful for scientists to search the related documents and know about their contents in a prompt way. In this paper we investigate a kind of data preprocessing technique used in SVM-based keyword extraction from scientific documents. Four definitions of regular scientific documents are proposed, and the analysis on the experimental results is performed based on the proposed definitions. The experimental results confirm the intuition that abstract is important for keywords extraction. © 2009 IEEE.
Data preprocessing in SVM-based keywords extraction from scientific documents / Wu, Chunguo; Marchese, Maurizio; Wang, Yufei; Krapivin, Mikalai; Wang, Chaoyong; Li, Xitong; Liang, Yanchun. - (2009), pp. 810-813. ((Intervento presentato al convegno 2009 4th International Conference on Innovative Computing, Information and Control, ICICIC 2009 tenutosi a Kaohsiung, Taiwan nel 2009.
Scheda prodotto non validato
I dati visualizzati non sono stati ancora sottoposti a validazione formale da parte dello Staff di IRIS, ma sono stati ugualmente trasmessi al Sito Docente Cineca (Loginmiur).
Titolo: | Data preprocessing in SVM-based keywords extraction from scientific documents |
Autori: | Chunguo, Wu; Marchese, Maurizio; Wang, Yufei; Krapivin, Mikalai; Wang, Chaoyong; Xitong, Li; Liang, Yanchun |
Autori Unitn: | |
Titolo del volume contenente il saggio: | 2009 4th International Conference on Innovative Computing, Information and Control, ICICIC 2009 |
Luogo di edizione: | NEW YORK |
Casa editrice: | IEEE |
Anno di pubblicazione: | 2009 |
Codice identificativo Scopus: | 2-s2.0-77951488374 |
ISBN: | 9780769538730 |
Handle: | http://hdl.handle.net/11572/189165 |
Citazione: | Data preprocessing in SVM-based keywords extraction from scientific documents / Wu, Chunguo; Marchese, Maurizio; Wang, Yufei; Krapivin, Mikalai; Wang, Chaoyong; Li, Xitong; Liang, Yanchun. - (2009), pp. 810-813. ((Intervento presentato al convegno 2009 4th International Conference on Innovative Computing, Information and Control, ICICIC 2009 tenutosi a Kaohsiung, Taiwan nel 2009. |
Appare nelle tipologie: |