BMC Bioinformatics | |
Prediction of aptamer-protein interacting pairs using an ensemble classifier in combination with various protein sequence attributes | |
Research Article | |
Lina Zhang1  Runtao Yang1  Rui Gao1  Chengjin Zhang2  Qing Song3  | |
[1] School of Control Science and Engineering, Shandong University, Jingshi Road No.17923, 250061, Jinan, China;School of Control Science and Engineering, Shandong University, Jingshi Road No.17923, 250061, Jinan, China;School of Mechanical, Electrical and Information Engineering, Shandong University at Weihai, Wenhuaxi Road No.180, 264209, Weihai, China;School of Electrical Engineering, University of Jinan, Nanxinzhuangxi Road No.336, 250022, Jinan, China; | |
关键词: Aptamer-protein interacting pairs; Ensemble method; Hybrid features; Imbalanced data problem; | |
DOI : 10.1186/s12859-016-1087-5 | |
received in 2016-01-07, accepted in 2016-05-17, 发布年份 2016 | |
来源: Springer | |
【 摘 要 】
BackgroundAptamer-protein interacting pairs play a variety of physiological functions and therapeutic potentials in organisms. Rapidly and effectively predicting aptamer-protein interacting pairs is significant to design aptamers binding to certain interested proteins, which will give insight into understanding mechanisms of aptamer-protein interacting pairs and developing aptamer-based therapies.ResultsIn this study, an ensemble method is presented to predict aptamer-protein interacting pairs with hybrid features. The features for aptamers are extracted from Pseudo K-tuple Nucleotide Composition (PseKNC) while the features for proteins incorporate Discrete Cosine Transformation (DCT), disorder information, and bi-gram Position Specific Scoring Matrix (PSSM). We investigate predictive capabilities of various feature spaces. The proposed ensemble method obtains the best performance with Youden’s Index of 0.380, using the hybrid feature space of PseKNC, DCT, bi-gram PSSM, and disorder information by 10-fold cross validation. The Relief-Incremental Feature Selection (IFS) method is adopted to obtain the optimal feature set. Based on the optimal feature set, the proposed method achieves a balanced performance with a sensitivity of 0.753 and a specificity of 0.725 on the training dataset, which indicates that this method can solve the imbalanced data problem effectively. To evaluate the prediction performance objectively, an independent testing dataset is used to evaluate the proposed method. Encouragingly, our proposed method performs better than previous study with a sensitivity of 0.738 and a Youden’s Index of 0.451.ConclusionsThese results suggest that the proposed method can be a potential candidate for aptamer-protein interacting pair prediction, which may contribute to finding novel aptamer-protein interacting pairs and understanding the relationship between aptamers and proteins.
【 授权许可】
CC BY
© Zhang et al. 2016
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202311092735213ZK.pdf | 1338KB | download | |
12864_2017_4190_Article_IEq5.gif | 1KB | Image | download |
12864_2017_3492_Article_IEq13.gif | 1KB | Image | download |
12864_2017_3655_Article_IEq3.gif | 1KB | Image | download |
12864_2017_3492_Article_IEq14.gif | 1KB | Image | download |
12864_2017_3655_Article_IEq4.gif | 1KB | Image | download |
12864_2017_4190_Article_IEq10.gif | 1KB | Image | download |
12896_2017_378_Article_IEq13.gif | 1KB | Image | download |
12864_2015_2297_Article_IEq17.gif | 1KB | Image | download |
【 图 表 】
12864_2015_2297_Article_IEq17.gif
12896_2017_378_Article_IEq13.gif
12864_2017_4190_Article_IEq10.gif
12864_2017_3655_Article_IEq4.gif
12864_2017_3492_Article_IEq14.gif
12864_2017_3655_Article_IEq3.gif
12864_2017_3492_Article_IEq13.gif
12864_2017_4190_Article_IEq5.gif
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]
- [35]
- [36]
- [37]
- [38]
- [39]
- [40]
- [41]
- [42]
- [43]
- [44]
- [45]
- [46]
- [47]
- [48]
- [49]
- [50]
- [51]
- [52]
- [53]
- [54]
- [55]
- [56]
- [57]
- [58]
- [59]
- [60]
- [61]
- [62]
- [63]
- [64]
- [65]
- [66]
- [67]
- [68]
- [69]
- [70]
- [71]
- [72]
- [73]
- [74]
- [75]
- [76]
- [77]