Background: Predicting prognosis in people with multiple sclerosis (pwMS) at early disease stages still remains an unmet need. Machine learning (ML) strategies demonstrated good reliability when applied for prediction in medicine. This study aimed at developing a predictive algorithm comparing different ML approaches, by using routine demographic, clinical and radiological data from a large multicentric cohort of newly diagnosed pwMS. Methods: Demographic, clinical, radiological and biochemical data were retrospectively collected at three Italian MS centers at baseline and four timepoints thereafter (6, 12, 24, and 36 months). Data from the first evaluation and subsequent 2-year follow-up were analyzed, comparing different ML models (Random Forest, Extra Trees, XGBoost, Logistic Regression and Support Vector Classifier) to predict progression independent of relapse activity (PIRA) at year 3. To understand how features impacted the selected model's output, a ML explainability analysis was performed on the whole cohort and on specific subsets of patients, those aged under 45 and those NEDA-3 at the 2-year follow-up. Results: Data from 719 pwMS (age 34.6 ± 11.2 years); female sex 501 (70%) were analyzed. Ninety-two pwMS (13%) developed PIRA at year 3. Random Forest achieved the highest score, with a test set area under the ROC curve (AUC) of 0.75 ± 0.06. Features with the highest predictive impact were Expanded Disability Status Scale at 24 months, age at symptom onset and disease duration at baseline. Conclusion: Our results showed the feasibility of applying ML techniques to predict short-term PIRA in newly diagnosed pwMS by using routine clinical practice data, paving the way for tailored and personalized approaches.

Machine Learning Analysis Applied to Prediction of Early Progression Independent of Relapse Activity in Multiple Sclerosis Patients / Poretto, Valentina; Endrizzi, Walter; Betti, Matteo; Bovo, Stefano; Bellinvia, Angelo; Ragni, Flavio; Lapucci, Caterina; Moroni, Monica; Marangoni, Sabrina; Portaccio, Emilio; Longo, Chiara; Gios, Lorenzo; Chierici, Marco; Jurman, Giuseppe; Giometto, Bruno; Inglese, Matilde; Osmani, Venet; Marenco, Manuela; Uccelli, Antonio; Amato, Maria Pia. - In: EUROPEAN JOURNAL OF NEUROLOGY. - ISSN 1351-5101. - 32:12(2025), pp. e70417.01-e70417.09. [10.1111/ene.70417]

Machine Learning Analysis Applied to Prediction of Early Progression Independent of Relapse Activity in Multiple Sclerosis Patients

Endrizzi, Walter;Bovo, Stefano;Ragni, Flavio;Moroni, Monica;Longo, Chiara;Chierici, Marco;Jurman, Giuseppe;Giometto, Bruno;Osmani, Venet;
2025-01-01

Abstract

Background: Predicting prognosis in people with multiple sclerosis (pwMS) at early disease stages still remains an unmet need. Machine learning (ML) strategies demonstrated good reliability when applied for prediction in medicine. This study aimed at developing a predictive algorithm comparing different ML approaches, by using routine demographic, clinical and radiological data from a large multicentric cohort of newly diagnosed pwMS. Methods: Demographic, clinical, radiological and biochemical data were retrospectively collected at three Italian MS centers at baseline and four timepoints thereafter (6, 12, 24, and 36 months). Data from the first evaluation and subsequent 2-year follow-up were analyzed, comparing different ML models (Random Forest, Extra Trees, XGBoost, Logistic Regression and Support Vector Classifier) to predict progression independent of relapse activity (PIRA) at year 3. To understand how features impacted the selected model's output, a ML explainability analysis was performed on the whole cohort and on specific subsets of patients, those aged under 45 and those NEDA-3 at the 2-year follow-up. Results: Data from 719 pwMS (age 34.6 ± 11.2 years); female sex 501 (70%) were analyzed. Ninety-two pwMS (13%) developed PIRA at year 3. Random Forest achieved the highest score, with a test set area under the ROC curve (AUC) of 0.75 ± 0.06. Features with the highest predictive impact were Expanded Disability Status Scale at 24 months, age at symptom onset and disease duration at baseline. Conclusion: Our results showed the feasibility of applying ML techniques to predict short-term PIRA in newly diagnosed pwMS by using routine clinical practice data, paving the way for tailored and personalized approaches.
2025
12
Poretto, Valentina; Endrizzi, Walter; Betti, Matteo; Bovo, Stefano; Bellinvia, Angelo; Ragni, Flavio; Lapucci, Caterina; Moroni, Monica; Marangoni, Sa...espandi
Machine Learning Analysis Applied to Prediction of Early Progression Independent of Relapse Activity in Multiple Sclerosis Patients / Poretto, Valentina; Endrizzi, Walter; Betti, Matteo; Bovo, Stefano; Bellinvia, Angelo; Ragni, Flavio; Lapucci, Caterina; Moroni, Monica; Marangoni, Sabrina; Portaccio, Emilio; Longo, Chiara; Gios, Lorenzo; Chierici, Marco; Jurman, Giuseppe; Giometto, Bruno; Inglese, Matilde; Osmani, Venet; Marenco, Manuela; Uccelli, Antonio; Amato, Maria Pia. - In: EUROPEAN JOURNAL OF NEUROLOGY. - ISSN 1351-5101. - 32:12(2025), pp. e70417.01-e70417.09. [10.1111/ene.70417]
File in questo prodotto:
File Dimensione Formato  
ENE-32-e70417.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Creative commons
Dimensione 1.3 MB
Formato Adobe PDF
1.3 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/480140
Citazioni
  • ???jsp.display-item.citation.pmc??? 1
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
  • OpenAlex 1
social impact