Machine learning algorithms perform dierently in settings with varying levels of training set mislabeling noise. Therefore, the choice of a good algorithm for a particular learning problem is crucial. In this paper, we introduce the \Sigmoid Rule" Framework focusing on the de- scription of classier behavior in noisy settings. The framework uses an existing model of the expected performance of learning algorithms as a sigmoid function of the signal-to-noise ratio in the training instances. We study the parameters of the above sigmoid function using ve dierent classiers, namely, Naive Bayes, kNN, SVM, a decision tree classier, and a rule-based classier. Our study leads to the denition of intuitive criteria based on the sigmoid parameters that can be used to compare the behavior of learning algorithms in the presence of varying levels of noise. Furthermore, we show that there exists a connection between these parameters and the characteristics of the underlying dataset, hinting at how the inherent properties of a dataset aect learning. The framework is applicable to concept drift scenaria, including modeling user behavior over time, and mining of noisy data series, as in sensor networks.
Scheda prodotto non validato
I dati visualizzati non sono stati ancora sottoposti a validazione formale da parte dello Staff di IRIS, ma sono stati ugualmente trasmessi al Sito Docente Cineca (Loginmiur).
Titolo: | SRF: A Framework for the Study of Classifier Behavior under Training Set Mislabeling Noise. |
Autori: | Mirylenka, Katsiaryna; Giannakopoulos, George; Palpanas, Themistoklis |
Autori Unitn: | |
Autore/i del libro: | AA. VV. |
Titolo del volume contenente il saggio: | Proceedings of the International Working Conference on Advanced Visual Interfa |
Luogo di edizione: | Berlin |
Casa editrice: | Springer |
Anno di pubblicazione: | 2012 |
Codice identificativo Scopus: | 2-s2.0-84861447252 |
ISBN: | 9783642302169 9783642302176 |
Handle: | http://hdl.handle.net/11572/91998 |
Appare nelle tipologie: | 04.1 Saggio in atti di convegno (Paper in proceedings) |