期刊论文详细信息
BMC Bioinformatics
Identification of DNA-binding proteins using multi-features fusion and binary firefly optimization algorithm
Research Article
Bo Gao1  Haiting Chai1  Jian Zhang1  Zhiqiang Ma1  Guifu Yang2 
[1]School of Computer Science and Information Technology, Northeast Normal University, 130117, Changchun, People’s Republic of China
[2]School of Computer Science and Information Technology, Northeast Normal University, 130117, Changchun, People’s Republic of China
[3]Office of Informatization Management and Planning, Northeast Normal University, 130117, Changchun, People’s Republic of China
关键词: DNA-binding proteins;    Binary firefly algorithm;    Feature selection;    Parameters optimization;   
DOI  :  10.1186/s12859-016-1201-8
 received in 2016-01-21, accepted in 2016-08-24,  发布年份 2016
来源: Springer
PDF
【 摘 要 】
BackgroundDNA-binding proteins (DBPs) play fundamental roles in many biological processes. Therefore, the developing of effective computational tools for identifying DBPs is becoming highly desirable.ResultsIn this study, we proposed an accurate method for the prediction of DBPs. Firstly, we focused on the challenge of improving DBP prediction accuracy with information solely from the sequence. Secondly, we used multiple informative features to encode the protein. These features included evolutionary conservation profile, secondary structure motifs, and physicochemical properties. Thirdly, we introduced a novel improved Binary Firefly Algorithm (BFA) to remove redundant or noisy features as well as select optimal parameters for the classifier. The experimental results of our predictor on two benchmark datasets outperformed many state-of-the-art predictors, which revealed the effectiveness of our method. The promising prediction performance on a new-compiled independent testing dataset from PDB and a large-scale dataset from UniProt proved the good generalization ability of our method. In addition, the BFA forged in this research would be of great potential in practical applications in optimization fields, especially in feature selection problems.ConclusionsA highly accurate method was proposed for the identification of DBPs. A user-friendly web-server named iDbP (identification of DNA-binding Proteins) was constructed and provided for academic use.
【 授权许可】

CC BY   
© The Author(s). 2016

【 预 览 】
附件列表
Files Size Format View
RO202311093549300ZK.pdf 1950KB PDF download
12864_2017_3487_Article_IEq8.gif 1KB Image download
12896_2015_228_Article_IEq1.gif 1KB Image download
12864_2017_3938_Article_IEq7.gif 1KB Image download
【 图 表 】

12864_2017_3938_Article_IEq7.gif

12896_2015_228_Article_IEq1.gif

12864_2017_3487_Article_IEq8.gif

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  • [39]
  • [40]
  • [41]
  • [42]
  • [43]
  • [44]
  • [45]
  • [46]
  • [47]
  • [48]
  • [49]
  • [50]
  • [51]
  • [52]
  • [53]
  文献评价指标  
  下载次数:0次 浏览次数:1次