In this paper a differential evolution (DE)-based feature selection technique is developed for anaphora resolution in a resource-poor language, namely Bengali. We discuss the issues of adapting a state-of-the-art English anaphora resolution system for a resource-poor language like Bengali. Performance of any anaphoric resolver greatly depends on the quality of a high accurate mention detector and the use of appropriate features for anaphora resolution. We develop a number of models for mention detection based on machine learning and heuristics. In anaphora resolution there is no globally accepted metric for measuring the performance, and each of them such as MUC, $$hbox {B}^{3}$$B3, CEAF, Blanc exhibit significantly different behaviors. Our proposed feature selection technique determines the near-optimal feature set by optimizing each of these evaluation metrics. Experiments show how a language-dependent system (designed primarily for English) can attain reasonably good performance level when re-trained and tested on a new language with a proper subset of features. Evaluation results yield the F-measure values of 66.70, 59.47, 51.56, 33.08 and 72.75 % for MUC, B3, CEAFM, CEAFE and BLANC, respectively.

Differential evolution-based feature selection technique for anaphora resolution / Sikdar, U. K.; Ekbal, A.; Saha, S.; Uryupina, O.; Poesio, M.. - In: SOFT COMPUTING. - ISSN 1432-7643. - STAMPA. - 19:8(2015), pp. 2149-2161. [10.1007/s00500-014-1397-3]

Differential evolution-based feature selection technique for anaphora resolution

Saha S.;Uryupina O.;Poesio M.
2015-01-01

Abstract

In this paper a differential evolution (DE)-based feature selection technique is developed for anaphora resolution in a resource-poor language, namely Bengali. We discuss the issues of adapting a state-of-the-art English anaphora resolution system for a resource-poor language like Bengali. Performance of any anaphoric resolver greatly depends on the quality of a high accurate mention detector and the use of appropriate features for anaphora resolution. We develop a number of models for mention detection based on machine learning and heuristics. In anaphora resolution there is no globally accepted metric for measuring the performance, and each of them such as MUC, $$hbox {B}^{3}$$B3, CEAF, Blanc exhibit significantly different behaviors. Our proposed feature selection technique determines the near-optimal feature set by optimizing each of these evaluation metrics. Experiments show how a language-dependent system (designed primarily for English) can attain reasonably good performance level when re-trained and tested on a new language with a proper subset of features. Evaluation results yield the F-measure values of 66.70, 59.47, 51.56, 33.08 and 72.75 % for MUC, B3, CEAFM, CEAFE and BLANC, respectively.
2015
8
Sikdar, U. K.; Ekbal, A.; Saha, S.; Uryupina, O.; Poesio, M.
Differential evolution-based feature selection technique for anaphora resolution / Sikdar, U. K.; Ekbal, A.; Saha, S.; Uryupina, O.; Poesio, M.. - In: SOFT COMPUTING. - ISSN 1432-7643. - STAMPA. - 19:8(2015), pp. 2149-2161. [10.1007/s00500-014-1397-3]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/295810
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 24
  • ???jsp.display-item.citation.isi??? 21
social impact