In benchmarking studies with simulated data sets in which two or more statistical methods are compared, over and above the search of a universally winning method, one may investigate how the winning method may vary over patterns of characteristics of the data or the data-generating mechanism. Interestingly, this problem bears strong formal similarities to the problem of looking for optimal treatment regimes in biostatistics when two or more treatment alternatives are available for the same medical problem or disease. It is outlined how optimal data-analytic regimes, that is to say, rules for optimally calling in statistical methods, can be derived from benchmarking studies with simulated data by means of supervised classification methods (e.g., classification trees). The approach is illustrated by means of analyses of data from a benchmarking study to compare two different algorithms for the estimation of a two-mode additive clustering model. ©2016ElsevierB.V.Allrightsreserved

Deriving optimal data-analytic regimes from benchmarking studies / Doove, Lisa L.; Wilderjans, Tom F.; Calcagnì, Antonio; Van Mechelen, Iven. - In: COMPUTATIONAL STATISTICS & DATA ANALYSIS. - ISSN 0167-9473. - 107:(2017), pp. 81-91. [10.1016/j.csda.2016.10.016]

Deriving optimal data-analytic regimes from benchmarking studies

Calcagnì, Antonio;
2017-01-01

Abstract

In benchmarking studies with simulated data sets in which two or more statistical methods are compared, over and above the search of a universally winning method, one may investigate how the winning method may vary over patterns of characteristics of the data or the data-generating mechanism. Interestingly, this problem bears strong formal similarities to the problem of looking for optimal treatment regimes in biostatistics when two or more treatment alternatives are available for the same medical problem or disease. It is outlined how optimal data-analytic regimes, that is to say, rules for optimally calling in statistical methods, can be derived from benchmarking studies with simulated data by means of supervised classification methods (e.g., classification trees). The approach is illustrated by means of analyses of data from a benchmarking study to compare two different algorithms for the estimation of a two-mode additive clustering model. ©2016ElsevierB.V.Allrightsreserved
2017
Doove, Lisa L.; Wilderjans, Tom F.; Calcagnì, Antonio; Van Mechelen, Iven
Deriving optimal data-analytic regimes from benchmarking studies / Doove, Lisa L.; Wilderjans, Tom F.; Calcagnì, Antonio; Van Mechelen, Iven. - In: COMPUTATIONAL STATISTICS & DATA ANALYSIS. - ISSN 0167-9473. - 107:(2017), pp. 81-91. [10.1016/j.csda.2016.10.016]
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0167947316302432-main.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 996.19 kB
Formato Adobe PDF
996.19 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/167031
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 7
  • OpenAlex ND
social impact