The improvement of text categorization by statistical methods can be performed from two main directions, namely the feature selection and the evaluation of characteristic weights. In this paper, we propose an enhanced text categorization method based on a modified mutual information algorithm and evaluation algorithm of characteristic weights which improves both aspects. The proposed method is applied to the benchmark test set Reuters-21578 Top10 to examine its effectiveness. Numerical results show that the precision, the recall and the value of F1 of the proposed method are all superior to those of existing conventional methods.

Text Categorization Method Based on Improved Mutual Information and Characteristic Weights Evaluation Algorithms / Z., Pei; Marchese, Maurizio; X., Shi; Y., Liang. - 4:(2007), pp. 87-91. (Intervento presentato al convegno Fourth International Conference on Fuzzy Systems and Knowledge Discovery tenutosi a Haikou, China nel 24-27 Aug. 2007) [10.1109/FSKD.2007.559].

Text Categorization Method Based on Improved Mutual Information and Characteristic Weights Evaluation Algorithms

Marchese, Maurizio;
2007-01-01

Abstract

The improvement of text categorization by statistical methods can be performed from two main directions, namely the feature selection and the evaluation of characteristic weights. In this paper, we propose an enhanced text categorization method based on a modified mutual information algorithm and evaluation algorithm of characteristic weights which improves both aspects. The proposed method is applied to the benchmark test set Reuters-21578 Top10 to examine its effectiveness. Numerical results show that the precision, the recall and the value of F1 of the proposed method are all superior to those of existing conventional methods.
2007
Fuzzy Systems and Knowledge Discovery
NEW YORK
IEEE
978-0-7695-2874-8
0-7695-2874-0
Z., Pei; Marchese, Maurizio; X., Shi; Y., Liang
Text Categorization Method Based on Improved Mutual Information and Characteristic Weights Evaluation Algorithms / Z., Pei; Marchese, Maurizio; X., Shi; Y., Liang. - 4:(2007), pp. 87-91. (Intervento presentato al convegno Fourth International Conference on Fuzzy Systems and Knowledge Discovery tenutosi a Haikou, China nel 24-27 Aug. 2007) [10.1109/FSKD.2007.559].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/77980
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 1
social impact