Graphical lasso is one of the most used estimators for inferring genetic networks. Despite its diffusion, there are several fields in applied research where the limits of detection of modern measurement technologies make the use of this estimator theoretically unfounded, even when the assumption of a multivariate Gaussian distribution is satisfied. Typical examples are data generated by polymerase chain reactions and flow cytometer. The combination of censoring and high-dimensionality make inference of the underlying genetic networks from these data very challenging. In this article, we propose an $ell_1$-penalized Gaussian graphical model for censored data and derive two EM-like algorithms for inference. We evaluate the computational efficiency of the proposed algorithms by an extensive simulation study and show that, when censored data are available, our proposal is superior to existing competitors both in terms of network recovery and parameter estimation. We apply the proposed method to gene expression data generated by microfluidic Reverse Transcription quantitative Polymerase Chain Reaction technology in order to make inference on the regulatory mechanisms of blood development. A software implementation of our method is available on github (https://github.com/LuigiAugugliaro/cglasso).

ℓ1-Penalized censored Gaussian graphical model / Augugliaro, L.; Abbruzzo, A.; Vinciotti, V.. - In: BIOSTATISTICS. - ISSN 1468-4357. - 21:2(2020), pp. e1-e16. [10.1093/biostatistics/kxy043]

ℓ1-Penalized censored Gaussian graphical model

Vinciotti V.
2020-01-01

Abstract

Graphical lasso is one of the most used estimators for inferring genetic networks. Despite its diffusion, there are several fields in applied research where the limits of detection of modern measurement technologies make the use of this estimator theoretically unfounded, even when the assumption of a multivariate Gaussian distribution is satisfied. Typical examples are data generated by polymerase chain reactions and flow cytometer. The combination of censoring and high-dimensionality make inference of the underlying genetic networks from these data very challenging. In this article, we propose an $ell_1$-penalized Gaussian graphical model for censored data and derive two EM-like algorithms for inference. We evaluate the computational efficiency of the proposed algorithms by an extensive simulation study and show that, when censored data are available, our proposal is superior to existing competitors both in terms of network recovery and parameter estimation. We apply the proposed method to gene expression data generated by microfluidic Reverse Transcription quantitative Polymerase Chain Reaction technology in order to make inference on the regulatory mechanisms of blood development. A software implementation of our method is available on github (https://github.com/LuigiAugugliaro/cglasso).
2020
2
Augugliaro, L.; Abbruzzo, A.; Vinciotti, V.
ℓ1-Penalized censored Gaussian graphical model / Augugliaro, L.; Abbruzzo, A.; Vinciotti, V.. - In: BIOSTATISTICS. - ISSN 1468-4357. - 21:2(2020), pp. e1-e16. [10.1093/biostatistics/kxy043]
File in questo prodotto:
File Dimensione Formato  
Augugliaro_BIOSTS_2020.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 313.55 kB
Formato Adobe PDF
313.55 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/276004
Citazioni
  • ???jsp.display-item.citation.pmc??? 3
  • Scopus 6
  • ???jsp.display-item.citation.isi??? 4
social impact