Evaluating reproducibility of AI algorithms in digital pathology with DAPPER

Bizzego, Andrea; Bussola, Nicole; Chierici, Marco; Maggio, Valerio; Francescatto, Margherita; Cima, Luca; Cristoforetti, Marco; Jurman, Giuseppe; Furlanello, Cesare

doi:10.1371/journal.pcbi.1006269

Artificial Intelligence is exponentially increasing its impact on healthcare. As deep learning is mastering computer vision tasks, its application to digital pathology is natural, with the promise of aiding in routine reporting and standardizing results across trials. Deep learning features inferred from digital pathology scans can improve validity and robustness of current clinico-pathological features, up to identifying novel histological patterns, e.g., from tumor infiltrating lymphocytes. In this study, we examine the issue of evaluating accuracy of predictive models from deep learning features in digital pathology, as an hallmark of reproducibility. We introduce the DAPPER framework for validation based on a rigorous Data Analysis Plan derived from the FDA’s MAQC project, designed to analyze causes of variability in predictive biomarkers. We apply the framework on models that identify tissue of origin on 787 Whole Slide Images from the Genotype-Tissue Expression (GTEx) project. We...

Artificial Intelligence is exponentially increasing its impact on healthcare. As deep learning is mastering computer vision tasks, its application to digital pathology is natural, with the promise of aiding in routine reporting and standardizing results across trials. Deep learning features inferred from digital pathology scans can improve validity and robustness of current clinico-pathological features, up to identifying novel histological patterns, e.g., from tumor infiltrating lymphocytes. In this study, we examine the issue of evaluating accuracy of predictive models from deep learning features in digital pathology, as an hallmark of reproducibility. We introduce the DAPPER framework for validation based on a rigorous Data Analysis Plan derived from the FDA’s MAQC project, designed to analyze causes of variability in predictive biomarkers. We apply the framework on models that identify tissue of origin on 787 Whole Slide Images from the Genotype-Tissue Expression (GTEx) project. We test three different deep learning architectures (VGG, ResNet, Inception) as feature extractors and three classifiers (a fully connected multilayer, Support Vector Machine and Random Forests) and work with four datasets (5, 10, 20 or 30 classes), for a total of 53, 000 tiles at 512 × 512 resolution. We analyze accuracy and feature stability of the machine learning classifiers, also demonstrating the need for diagnostic tests (e.g., random labels) to identify selection bias and risks for reproducibility. Further, we use the deep features from the VGG model from GTEx on the KIMIA24 dataset for identification of slide of origin (24 classes) to train a classifier on 1, 060 annotated tiles and validated on 265 unseen ones. The DAPPER software, including its deep learning pipeline and the Histological Imaging—Newsy Tiles (HINT) benchmark dataset derived from GTEx, is released as a basis for standardization and validation initiatives in AI for digital pathology.

Evaluating reproducibility of AI algorithms in digital pathology with DAPPER / Bizzego, Andrea; Bussola, Nicole; Chierici, Marco; Maggio, Valerio; Francescatto, Margherita; Cima, Luca; Cristoforetti, Marco; Jurman, Giuseppe; Furlanello, Cesare. - In: PLOS COMPUTATIONAL BIOLOGY. - ISSN 1553-7358. - 15:3(2019). [10.1371/journal.pcbi.1006269]

Evaluating reproducibility of AI algorithms in digital pathology with DAPPER

Bizzego, Andrea;Bussola, Nicole;Chierici, Marco;Maggio, Valerio;Francescatto, Margherita;Cima, Luca;Cristoforetti, Marco;Jurman, Giuseppe;Furlanello, Cesare

2019-01-01

Abstract

Artificial Intelligence is exponentially increasing its impact on healthcare. As deep learning is mastering computer vision tasks, its application to digital pathology is natural, with the promise of aiding in routine reporting and standardizing results across trials. Deep learning features inferred from digital pathology scans can improve validity and robustness of current clinico-pathological features, up to identifying novel histological patterns, e.g., from tumor infiltrating lymphocytes. In this study, we examine the issue of evaluating accuracy of predictive models from deep learning features in digital pathology, as an hallmark of reproducibility. We introduce the DAPPER framework for validation based on a rigorous Data Analysis Plan derived from the FDA’s MAQC project, designed to analyze causes of variability in predictive biomarkers. We apply the framework on models that identify tissue of origin on 787 Whole Slide Images from the Genotype-Tissue Expression (GTEx) project. We...

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2019
			
	Titolo del periodico (Journal title)
	
				PLOS COMPUTATIONAL BIOLOGY
			
	Numero e parte del fascicolo (Issue number and part)
	
				3
			
	DOI
	
				https://dx.doi.org/10.1371/journal.pcbi.1006269
			
	Codice PubMed (PubMed Identifier)
	
				30917113
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-85065000125
			
	Codice WOS (WOS identifier)
	
				WOS:000463877900003
			
	Tutti gli autori
	
						Bizzego, Andrea; Bussola, Nicole; Chierici, Marco; Maggio, Valerio; Francescatto, Margherita; Cima, Luca; Cristoforetti, Marco; Jurman, Giuseppe; Furl...espandi
						
	Citazione
	
				Evaluating reproducibility of AI algorithms in digital pathology with DAPPER / Bizzego, Andrea; Bussola, Nicole; Chierici, Marco; Maggio, Valerio; Francescatto, Margherita; Cima, Luca; Cristoforetti, Marco; Jurman, Giuseppe; Furlanello, Cesare. - In: PLOS COMPUTATIONAL BIOLOGY. - ISSN 1553-7358. - 15:3(2019). [10.1371/journal.pcbi.1006269]
			
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

File	Dimensione	Formato
journal.pcbi.1006269.pdf accesso aperto Descrizione: Articolo principale Tipologia: Versione editoriale (Publisher’s layout) Licenza: Creative commons Dimensione 3.46 MB Formato Adobe PDF Visualizza/Apri	3.46 MB	Adobe PDF	Visualizza/Apri