期刊论文详细信息
Evolutionary Bioinformatics
Profiles and Majority Voting-Based Ensemble Method for Protein Secondary Structure Prediction
Hafida Bouziane1 
关键词: protein secondary structure prediction;    k-Nearest Neighbors;    feed-forward Neural Networks;    Multi-class Support Vector Machines (M-SVMs);    ensemble method;    Position-Specific Scoring Matrix (PSSM) profiles;   
DOI  :  10.4137/EBO.S7931
学科分类:生物技术
来源: Sage Journals
PDF
【 摘 要 】

Machine learning techniques have been widely applied to solve the problem of predicting protein secondary structure from the amino acid sequence. They have gained substantial success in this research area. Many methods have been used including k-Nearest Neighbors (k-NNs), Hidden Markov Models (HMMs), Artificial Neural Networks (ANNs) and Support Vector Machines (SVMs), which have attracted attention recently. Today, the main goal remains to improve the prediction quality of the secondary structure elements. The prediction accuracy has been continuously improved over the years, especially by using hybrid or ensemble methods and incorporating evolutionary information in the form of profiles extracted from alignments of multiple homologous sequences. In this paper, we investigate how best to combine k-NNs, ANNs and Multi-class SVMs (M-SVMs) to improve secondary structure prediction of globular proteins. An ensemble method which combines the outputs of two feed-forward ANNs, k-NN and three M-SVM classifiers has been applied. Ensemble members are combined using two variants of majority voting rule. An heuristic based filter has also been applied to refine the prediction. To investigate how much improvement the general ensemble method can give rather than the individual classifiers that make up the ensemble, we have experimented with the proposed system on the two widely used benchmark datasets RS 126 and CB513 using cross-validation tests by including PSI-BLAST position-specific scoring matrix (PSSM) profiles as inputs. The experimental results reveal that the proposed system yields significant performance gains when compared with the best individual classifier.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201901210769409ZK.pdf 666KB PDF download
  文献评价指标  
  下载次数:6次 浏览次数:11次