In the last years, Genome-Wide Associations Studies (GWAS) found many variants associated with complex diseases. However, the biological and molecular links between these variants and phenotypes are still mostly unknown. Also, even if sample sizes are constantly increasing, the associated variants do not explain all the heritability estimated for many traits. Many hypotheses have been proposed to explain the problem: from variant-variant interactions, the effect of rare and ultra-rare coding variants and also technical biases related to sequencing or statistic on sexual chromosomes. In this thesis, we mainly explore the hypothesis of variant-variant interaction and, briefly, the rare coding variants hypothesis while also considering possible molecular effects like allele-specific expression and the effects of variants on protein interfaces. Some parts of the thesis are also devoted to explore the implementation of efficient computational tools to explore these effects and to perform scalable genotyping of germline single nucleotide polymorphisms (SNPs) in huge datasets. The main part of the thesis regards the development of a new resource to identify putative variant-variant interactions. In particular, we integrated ChIP-seq data from ENCODE, transcription factor binding motifs from several resources and genotype and transcript level data from GTeX and TCGA. This new dataset allows us to formalize new models, to make hypothesis and to find putative novel associations and interactions between (mainly non-coding) germline variants and phenotypes, like cancer-specific phenotypes. In particular, we focused on the characterization of breast cancer and Alzheimer’s Disease GWAS risk variants, looking for putative variants’ interactions. Recently, the study of rare variants has become feasible thanks to the biobanks that made available genotypes and clinical data of thousands of patients. We characterize and explore the possible effects of rare coding inherited polymorphisms on protein interfaces in the UKBioBank trying to understand if the change in structure of protein can be one of the causes of complex diseases. Another part of the thesis explores variants as causal molecular effect for allele-specific expression. In particular, we describe UTRs variants that can alter the post-transcriptional regulation in mRNA leading to a phenomenon where an allele is more expressed than the other. Finally, we show those variants can have prognostic significance in breast cancer. This thesis work introduces results and computational tools that can be useful to a broad community of researcher studying human polymorphisms effects.

Computational analysis of effects and interactions among human variants in complex diseases / Valentini, Samuel. - (2022 Oct 18), pp. 1-167. [10.15168/11572_354827]

Computational analysis of effects and interactions among human variants in complex diseases

Valentini, Samuel
2022-10-18

Abstract

In the last years, Genome-Wide Associations Studies (GWAS) found many variants associated with complex diseases. However, the biological and molecular links between these variants and phenotypes are still mostly unknown. Also, even if sample sizes are constantly increasing, the associated variants do not explain all the heritability estimated for many traits. Many hypotheses have been proposed to explain the problem: from variant-variant interactions, the effect of rare and ultra-rare coding variants and also technical biases related to sequencing or statistic on sexual chromosomes. In this thesis, we mainly explore the hypothesis of variant-variant interaction and, briefly, the rare coding variants hypothesis while also considering possible molecular effects like allele-specific expression and the effects of variants on protein interfaces. Some parts of the thesis are also devoted to explore the implementation of efficient computational tools to explore these effects and to perform scalable genotyping of germline single nucleotide polymorphisms (SNPs) in huge datasets. The main part of the thesis regards the development of a new resource to identify putative variant-variant interactions. In particular, we integrated ChIP-seq data from ENCODE, transcription factor binding motifs from several resources and genotype and transcript level data from GTeX and TCGA. This new dataset allows us to formalize new models, to make hypothesis and to find putative novel associations and interactions between (mainly non-coding) germline variants and phenotypes, like cancer-specific phenotypes. In particular, we focused on the characterization of breast cancer and Alzheimer’s Disease GWAS risk variants, looking for putative variants’ interactions. Recently, the study of rare variants has become feasible thanks to the biobanks that made available genotypes and clinical data of thousands of patients. We characterize and explore the possible effects of rare coding inherited polymorphisms on protein interfaces in the UKBioBank trying to understand if the change in structure of protein can be one of the causes of complex diseases. Another part of the thesis explores variants as causal molecular effect for allele-specific expression. In particular, we describe UTRs variants that can alter the post-transcriptional regulation in mRNA leading to a phenomenon where an allele is more expressed than the other. Finally, we show those variants can have prognostic significance in breast cancer. This thesis work introduces results and computational tools that can be useful to a broad community of researcher studying human polymorphisms effects.
18-ott-2022
XXXIV
2021-2022
CIBIO (29/10/12-)
Biomolecular Sciences
Romanel, Alessandro
no
Inglese
Settore BIO/11 - Biologia Molecolare
File in questo prodotto:
File Dimensione Formato  
phd_unitn_samuel_valentini.pdf

accesso aperto

Descrizione: Tesi di dottorato
Tipologia: Tesi di dottorato (Doctoral Thesis)
Licenza: Creative commons
Dimensione 8.2 MB
Formato Adobe PDF
8.2 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/354827
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact