Facilitating standardized COVID-19 suspicion prediction based on computed tomography radiomics in a multi-demographic setting

Nagaraj, Yeshaswini; De Jonge, Gonda; Andreychenko, Anna; Presti, Gabriele; Fink, Matthias A.; Pavlov, Nikolay; Quattrocchi, Carlo C.; Morozov, Sergey; Veldhuis, Raymond; Oudkerk, Matthijs; Van Ooijen, Peter M. A.

doi:10.1007/s00330-022-08730-6

Objective: To develop an automatic COVID-19 Reporting and Data System (CO-RADS)–based classification in a multi-demographic setting. Methods: This multi-institutional review boards–approved retrospective study included 2720 chest CT scans (mean age, 58 years [range 18–100 years]) from Italian and Russian patients. Three board-certified radiologists from three countries assessed randomly selected subcohorts from each population and provided CO-RADS–based annotations. CT radiomic features were extracted from the selected subcohorts after preprocessing steps like lung lobe segmentation and automatic noise reduction. We compared three machine learning models, logistic regression (LR), multilayer perceptron (MLP), and random forest (RF) for the automated CO-RADS classification. Model evaluation was carried out in two scenarios, first, training on a mixed multi-demographic subcohort and testing on an independent hold-out dataset. In the second scenario, training was done on a single demography and externally validated on the other demography. Results: The overall inter-observer agreement for the CO-RADS scoring between the radiologists was substantial (k = 0.80). Irrespective of the type of validation test scenario, suspected COVID-19 CT scans were identified with an accuracy of 84%. SHapley Additive exPlanations (SHAP) interpretation showed that the “wavelet_(LH)_GLCM_Imc1” feature had a positive impact on COVID prediction both with and without noise reduction. The application of noise reduction improved the overall performance between the classifiers for all types. Conclusion: Using an automated model based on the COVID-19 Reporting and Data System (CO-RADS), we achieved clinically acceptable performance in a multi-demographic setting. This approach can serve as a standardized tool for automated COVID-19 assessment. Keypoints: • Automatic CO-RADS scoring of large-scale multi-demographic chest CTs with mean AUC of 0.93 ± 0.04. • Validation procedure resembles TRIPOD 2b and 3 categories, enhancing the quality of experimental design to test the cross-dataset domain shift between institutions aiding clinical integration. • Identification of COVID-19 pneumonia in the presence of community-acquired pneumonia and other comorbidities with an AUC of 0.92.

Facilitating standardized COVID-19 suspicion prediction based on computed tomography radiomics in a multi-demographic setting / Nagaraj, Y., de Jonge, G., Andreychenko, A., Presti, G., Fink, M.A., Pavlov, N., Quattrocchi, C.C., Morozov, S., Veldhuis, R., Oudkerk, M., van Ooijen, P.M.A.. - In: EUROPEAN RADIOLOGY. - ISSN 1432-1084. - 32:9(2022), pp. 6384-6396. [10.1007/s00330-022-08730-6]