学位论文详细信息
Discovering Protein Sequence-Structure Motifs and Two Applications to Structural Prediction
Computer Science;Bioinformatics;Data mining;Clustering;Sequential and Structural Motif Discovery;Secondary Structure Prediction;Local Tertiary Structure Prediction;SVM
Tang, Thomas Cheuk Kai
University of Waterloo
关键词: Computer Science;    Bioinformatics;    Data mining;    Clustering;    Sequential and Structural Motif Discovery;    Secondary Structure Prediction;    Local Tertiary Structure Prediction;    SVM;   
Others  :  https://uwspace.uwaterloo.ca/bitstream/10012/1188/1/tcktang2004.pdf
瑞士|英语
来源: UWSPACE Waterloo Institutional Repository
PDF
【 摘 要 】

This thesis investigates the correlations between short protein peptide sequences and local tertiary structures.In particular, it introduces a novel algorithm for partitioning short protein segments into clusters of local sequence-structure motifs, and demonstrates that these motif clusters contain useful structural information via two applications to structural prediction. The first application utilizes motif clusters to predict local protein tertiary structures.A novel dynamic programming algorithm that performs comparably with some of the best existing algorithms is described. The second application exploits the capability of motif clusters in recognizing regular secondary structures to improve the performance of secondary structure prediction based on Support Vector Machines.Empirical results show significant improvement in overall prediction accuracy with no performance degradation in any specific aspect being measured. The encouraging results obtained illustrate the great potential of using local sequence-structure motifs to tackle protein structure predictions and possibly other important problems in computational biology.

【 预 览 】
附件列表
Files Size Format View
Discovering Protein Sequence-Structure Motifs and Two Applications to Structural Prediction 1426KB PDF download
  文献评价指标  
  下载次数:12次 浏览次数:45次