学位论文详细信息
Efficient algorithm for selecting protein residue-residue contacts
Protein Contacts;Protein Structure;Contact Selection;Protein Folding;Integer Programming
Ye, Qing ; Peng ; Jian
关键词: Protein Contacts;    Protein Structure;    Contact Selection;    Protein Folding;    Integer Programming;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/104910/YE-THESIS-2019.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

The functions of proteins are largely determined by their structures. Determination of the protein three-dimensional structure is experimentally and computationally challenging. Since amino acids residues that are spatially close often co-evolve, the correlation allows us to predict the contacts from multiple sequence alignments. The predicted contacts can then be used as spatial constraints and offer guidance in protein structure prediction. The constraints can be used as inputs to a protein structure prediction algorithm to produce "decoy" models as tentative 3D structures for proteins. However, the computation power required for structural prediction grows exponentially with respect to the number of contacts selected. Thus selecting few and yet informative contacts are essential for producing high-quality models quickly. Existing contact prediction methods aim for improving precision and recall. However, not all contacts offer the same level of structural information in terms of structure prediction. Therefore, the strategy to select contacts of highest confidence may not be ideal for structure prediction. Here we present an efficient algorithm, ContactSel, to select contacts for assisting contact-guided ab inito folding. We take the key idea that contacts that involve residues far apart (long-ranged) and collections of contacts that are most diverse contains more information than contacts that are shorter ranged and closed by. We formulate the contact selection problem into an integer programming algorithm to select structurally diverse contacts.For evaluation, we generated decoy models using L/2 contacts selected by ContactSel and a naive selection baseline. We show that we achieved significant improvement on the CASP 12 domain set.

【 预 览 】
附件列表
Files Size Format View
Efficient algorithm for selecting protein residue-residue contacts 3873KB PDF download
  文献评价指标  
  下载次数:19次 浏览次数:44次