期刊论文详细信息
Molecules
EPuL: An Enhanced Positive-Unlabeled Learning Algorithm for the Prediction of Pupylation Sites
Gai-Ge Wang1  Arun Kumar Sangaiah2  Xuanguo Nan3  Xiaosa Zhao3  Lingling Bao3  Xiaowei Zhao3  Zhiqiang Ma3 
[1] School of Computer Science and Technology, Jiangsu Normal University, Xuzhou 221116, China;School of Computing Science and Engineering, VIT University, Vellore 632014, Tamil Nadu, India;School of Information Science and Technology, Northeast Normal University, Changchun 130117, China;
关键词: positive-unlabeled learning algorithm;    pupylation sites;    prediction;    web server;    support vector machine;   
DOI  :  10.3390/molecules22091463
来源: DOAJ
【 摘 要 】

Protein pupylation is a type of post-translation modification, which plays a crucial role in cellular function of bacterial organisms in prokaryotes. To have a better insight of the mechanisms underlying pupylation an initial, but important, step is to identify pupylation sites. To date, several computational methods have been established for the prediction of pupylation sites which usually artificially design the negative samples using the verified pupylation proteins to train the classifiers. However, if this process is not properly done it can affect the performance of the final predictor dramatically. In this work, different from previous computational methods, we proposed an enhanced positive-unlabeled learning algorithm (EPuL) to the pupylation site prediction problem, which uses only positive and unlabeled samples. Firstly, we separate the training dataset into the positive dataset and the unlabeled dataset which contains the remaining non-annotated lysine residues. Then, the EPuL algorithm is utilized to select the reliably negative initial dataset and then iteratively pick out the non-pupylation sites. The performance of the proposed method was measured with an accuracy of 90.24%, an Area Under Curve (AUC) of 0.93 and an MCC of 0.81 by 10-fold cross-validation. A user-friendly web server for predicting pupylation sites was developed and was freely available at http://59.73.198.144:8080/EPuL

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次