The benefits of the Matthews correlation coefficient ({MCC}) over the diagnostic odds ratio ({DOR}) in binary classification assessment

IRIS

To assess the quality of a binary classification, researchers often take advantage of a four-entry contingency table called confusion matrix , containing true positives, true negatives, false positives, and false negatives. To recap the four values of a confusion matrix in a unique score, researchers and statisticians have developed several rates and metrics. In the past, several scientific studies already showed why the Matthews correlation coefficient (MCC) is more informative and trustworthy than confusion-entropy error, accuracy, F 1 score, bookmaker informedness, markedness, and balanced accuracy. In this study, we compare the MCC with the diagnostic odds ratio (DOR), a statistical rate employed sometimes in biomedical sciences. After examining the properties of the MCC and of the DOR, we describe the relationships between them, by also taking advantage of an innovative geometrical plot called confusion tetrahedron , presented here for the first time. We then report some use cases where the MCC and the DOR produce discordant outcomes, and explain why the Matthews correlation coefficient is more informative and reliable between the two. Our results can have a strong impact in computer science and statistics, because they clearly explain why the trustworthiness of the information provided by the Matthews correlation coefficient is higher than the one generated by the diagnostic odds ratio.

The benefits of the Matthews correlation coefficient ({MCC}) over the diagnostic odds ratio ({DOR}) in binary classification assessment / Chicco, D., Starovoitov, V., Jurman, G.. - In: IEEE ACCESS. - ISSN 2169-3536. - ELETTRONICO. - 9:(2021), pp. 47112-47124. [10.1109/access.2021.3068614]

The benefits of the Matthews correlation coefficient ({MCC}) over the diagnostic odds ratio ({DOR}) in binary classification assessment

Chicco, Davide^Primo;Starovoitov, Valery^Secondo;Jurman, Giuseppe^Ultimo

2021-01-01

Abstract

To assess the quality of a binary classification, researchers often take advantage of a four-entry contingency table called confusion matrix , containing true positives, true negatives, false positives, and false negatives. To recap the four values of a confusion matrix in a unique score, researchers and statisticians have developed several rates and metrics. In the past, several scientific studies already showed why the Matthews correlation coefficient (MCC) is more informative and trustworthy than confusion-entropy error, accuracy, F 1 score, bookmaker informedness, markedness, and balanced accuracy. In this study, we compare the MCC with the diagnostic odds ratio (DOR), a statistical rate employed sometimes in biomedical sciences. After examining the properties of the MCC and of the DOR, we describe the relationships between them, by also taking advantage of an innovative geometrical plot called confusion tetrahedron , presented here for the first time. We then report some use cases where the MCC and the DOR produce discordant outcomes, and explain why the Matthews correlation coefficient is more informative and reliable between the two. Our results can have a strong impact in computer science and statistics, because they clearly explain why the trustworthiness of the information provided by the Matthews correlation coefficient is higher than the one generated by the diagnostic odds ratio.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2021
			
	Titolo del periodico (Journal title)
	
				IEEE ACCESS
			
	DOI
	
				https://dx.doi.org/10.1109/access.2021.3068614
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-85103289007
			
	Codice WOS (WOS identifier)
	
				WOS:000637189700001
			
	Tutti gli autori
	
						Chicco, Davide; Starovoitov, Valery; Jurman, Giuseppe
					
	Citazione
	
				The benefits of the Matthews correlation coefficient ({MCC}) over the diagnostic odds ratio ({DOR}) in binary classification assessment / Chicco, D., Starovoitov, V., Jurman, G.. - In: IEEE ACCESS. - ISSN 2169-3536. - ELETTRONICO. - 9:(2021), pp. 47112-47124. [10.1109/access.2021.3068614]
			
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

File	Dimensione	Formato
The_Benefits_of_the_Matthews_Correlation_Coefficient_MCC_Over_the_Diagnostic_Odds_Ratio_DOR_in_Binary_Classification_Assessment.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Creative commons Dimensione 3.25 MB Formato Adobe PDF Visualizza/Apri	3.25 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/423590

Citazioni

ND

98

74

110

social impact