With the recent developments in the field of multi-omics integration, the interest in factors such as data preprocessing, choice of the integration method and the number of different omics considered had increased. In this work, the impact of these factors is explored when solving the problem of sample classification, by comparing the performances of five unsupervised algorithms: Multiple Canonical Correlation Analysis, Multiple Co-Inertia Analysis, Multiple Factor Analysis, Joint and Individual Variation Explained and Similarity Network Fusion. These methods were applied to three real data sets taken from literature and several ad hoc simulated scenarios to discuss classification performance in different conditions of noise and signal strength across the data types. The impact of experimental design, feature selection and parameter training has been also evaluated to unravel important conditions that can affect the accuracy of the result.

Multi-omics integration-a comparison of unsupervised clustering methodologies / Tini, Giulia; Marchetti, Luca; Priami, Corrado; Scott-Boyer, Marie-Pier. - In: BRIEFINGS IN BIOINFORMATICS. - ISSN 1467-5463. - 20:4(2019), pp. 1269-1279. [10.1093/bib/bbx167]

Multi-omics integration-a comparison of unsupervised clustering methodologies

Tini, Giulia;Marchetti, Luca;Priami, Corrado;Scott-Boyer, Marie-Pier
2019-01-01

Abstract

With the recent developments in the field of multi-omics integration, the interest in factors such as data preprocessing, choice of the integration method and the number of different omics considered had increased. In this work, the impact of these factors is explored when solving the problem of sample classification, by comparing the performances of five unsupervised algorithms: Multiple Canonical Correlation Analysis, Multiple Co-Inertia Analysis, Multiple Factor Analysis, Joint and Individual Variation Explained and Similarity Network Fusion. These methods were applied to three real data sets taken from literature and several ad hoc simulated scenarios to discuss classification performance in different conditions of noise and signal strength across the data types. The impact of experimental design, feature selection and parameter training has been also evaluated to unravel important conditions that can affect the accuracy of the result.
2019
4
Tini, Giulia; Marchetti, Luca; Priami, Corrado; Scott-Boyer, Marie-Pier
Multi-omics integration-a comparison of unsupervised clustering methodologies / Tini, Giulia; Marchetti, Luca; Priami, Corrado; Scott-Boyer, Marie-Pier. - In: BRIEFINGS IN BIOINFORMATICS. - ISSN 1467-5463. - 20:4(2019), pp. 1269-1279. [10.1093/bib/bbx167]
File in questo prodotto:
File Dimensione Formato  
Tini_Multi-omics integration.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 970.5 kB
Formato Adobe PDF
970.5 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/326038
Citazioni
  • ???jsp.display-item.citation.pmc??? 35
  • Scopus 87
  • ???jsp.display-item.citation.isi??? 25
social impact