On the Convergence of Protein Structure and Dynamics: Statistical Learning Studies of Pseudo Folding Pathways

IRIS

Many algorithms that attempt to predict proteins' native structure from sequence need to generate a large set of hypotheses in order to ensure that nearly correct structures are included, leading to the problem of assessing the quality of alternative 3D conformations. This problem has been mostly approached by focusing on the final 3D conformation, with machine learning techniques playing a leading role. We argue in this paper that additional information for recognising native-like structures can be obtained by regarding the final conformation as the result of a generative process reminiscent of the folding process that generates structures in nature. We introduce a coarse representation of protein pseudo-folding based on binary trees and introduce a kernel function for assessing their similarity. Kernel-based analysis techniques empirically demonstrate a significant correlation between information contained into pseudo-folding trees and features of native folds in a large and non-redu...

On the Convergence of Protein Structure and Dynamics: Statistical Learning Studies of Pseudo Folding Pathways

A. Vullo;Passerini, Andrea;P. Frasconi;F. Costa;G. Pollastri

2008-01-01

Abstract

Many algorithms that attempt to predict proteins' native structure from sequence need to generate a large set of hypotheses in order to ensure that nearly correct structures are included, leading to the problem of assessing the quality of alternative 3D conformations. This problem has been mostly approached by focusing on the final 3D conformation, with machine learning techniques playing a leading role. We argue in this paper that additional information for recognising native-like structures can be obtained by regarding the final conformation as the result of a generative process reminiscent of the folding process that generates structures in nature. We introduce a coarse representation of protein pseudo-folding based on binary trees and introduce a kernel function for assessing their similarity. Kernel-based analysis techniques empirically demonstrate a significant correlation between information contained into pseudo-folding trees and features of native folds in a large and non-redu...

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2008
			
	Titolo del volume (Proceedings title)
	
				Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics: 6th European Conference,  Proceedings
			
	Luogo di edizione (Place of publication)
	
				HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY
			
	Casa editrice (Publisher)
	
				Springer Verlag
			
	ISBN
	
				9783540787563
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-47249097119
			
	Codice WOS (WOS identifier)
	
				WOS:000254612300018
			
	Tutti gli autori
	
						A., Vullo; Passerini, Andrea; P., Frasconi; F., Costa; G., Pollastri
					
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/20312

Citazioni

ND

1

0

ND

social impact