One of the aims of modern bioinformatics is to discover the molecular mechanisms which rule the protein operation. This is a fundamental step in understanding the complex processes involved in living systems and would possibly allow us to correct dysfunctions. A protein may have different functions. Such functions are determined by the protein primary structure, i.e. the sequence of amino acids that constitute it, and by their spatial disposition (tertiary structure). Protein function identification is a challenging problem as it involves the combination of a large number of variables, most of which still unknown. Therefore approaches for the automatic detection of protein functional sites are needed. In this work we concentrate on the prediction of functional residues, i.e. residues which directly interact with the substrate. In its simpler formulation, the problem can be cast into a binary classification task at the residue level. Preliminary experiments showed that evolutionary enriched sequence-based information alone achieve performance which are statistically indistinguishable from carefully crafted features extracted from 3D coordinates. While allowing to apply functional residue prediction to the much wider range of sequenced proteins with possibly unknown 3D structure, such results indicate that using structural information in the automatic prediction of protein functional aspects is a non-trivial task. We show that by modeling physico-chemical properties of the residue structural neighbourhood we obtain significant improvements, but further research is needed in order to fully exploit the information provided by the protein 3D structure.
Automatic Prediction of Functional Residues from Sequence and Structural Information / Cilia, Elisa; Brunato, Mauro; Passerini, Andrea. - ELETTRONICO. - (2008), pp. 1-10.
Automatic Prediction of Functional Residues from Sequence and Structural Information
Cilia, Elisa;Brunato, Mauro;Passerini, Andrea
2008-01-01
Abstract
One of the aims of modern bioinformatics is to discover the molecular mechanisms which rule the protein operation. This is a fundamental step in understanding the complex processes involved in living systems and would possibly allow us to correct dysfunctions. A protein may have different functions. Such functions are determined by the protein primary structure, i.e. the sequence of amino acids that constitute it, and by their spatial disposition (tertiary structure). Protein function identification is a challenging problem as it involves the combination of a large number of variables, most of which still unknown. Therefore approaches for the automatic detection of protein functional sites are needed. In this work we concentrate on the prediction of functional residues, i.e. residues which directly interact with the substrate. In its simpler formulation, the problem can be cast into a binary classification task at the residue level. Preliminary experiments showed that evolutionary enriched sequence-based information alone achieve performance which are statistically indistinguishable from carefully crafted features extracted from 3D coordinates. While allowing to apply functional residue prediction to the much wider range of sequenced proteins with possibly unknown 3D structure, such results indicate that using structural information in the automatic prediction of protein functional aspects is a non-trivial task. We show that by modeling physico-chemical properties of the residue structural neighbourhood we obtain significant improvements, but further research is needed in order to fully exploit the information provided by the protein 3D structure.File | Dimensione | Formato | |
---|---|---|---|
Cilia_TR.pdf
accesso aperto
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
2.13 MB
Formato
Adobe PDF
|
2.13 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione