Informatics in Medicine Unlocked | |
DeepDBP: Deep neural networks for identification of DNA-binding proteins | |
Swakkhar Shatabda1  Sheikh Adilina2  Md Tawab Alam Khan3  Nazia Afrin Neezi3  Shadman Shadab3  | |
[1] Department of Computer Science and Engineering, United International University, Plot-2, United City, Madani Avenue, Badda, Dhaka, 1212, Bangladesh;Corresponding author.;Department of Computer Science and Engineering, United International University, Plot-2, United City, Madani Avenue, Badda, Dhaka, 1212, Bangladesh; | |
关键词: DNA-Binding proteins; Deep learning; CNN; Classification algorithm; Feature; Selection; | |
DOI : | |
来源: DOAJ |
【 摘 要 】
DNA-Binding proteins (DBP) are associated with many cellular level functions which includes but not limited to body's defense mechanism and oxygen transportation. They bind DNAs and interact with them. In the past DBPs were identified using experimental lab based methods. However, in the recent years researchers are using supervised learning to identify DBPs solely from protein sequences. In this paper, we apply deep learning methods to identify DBPs. We have proposed two different deep learning based methods for identifying DBPs: DeepDBP-ANN and DeepDBP-CNN. DeepDBP-ANN uses a generated set of features trained on traditional neural network and DeepDBP-CNN uses a pre-learned embedding and Convolutional Neural Network. Both of our proposed methods were able to produce state-of-the-art results when tested on standard benchmark datasets. DeepDBP-ANN had a train accuracy of 99.02% and test accuracy of 82.80%.And DeepDBP-CNN though had train accuracy of 94.32%, it excelled at identifying test instances with 84.31% accuracy. All methods are available codes and methods are available for use at: https://github.com/antorkhan/DNABinding.
【 授权许可】
Unknown