期刊论文详细信息
PeerJ
Neighborhood-based inference and restricted Boltzmann machine for microbe and drug associations prediction
article
Xiaolong Cheng1  Jia Qu1  Shuangbao Song1  Zekang Bian2 
[1] School of Computer Science and Artificial Intelligence, Changzhou University;School of AI & Computer Science, Jiangnan University
关键词: Drug;    Microbe;    Association prediction;    Neighborhood-based inference;    Restricted Boltzmann machine;    Ensemble learning;   
DOI  :  10.7717/peerj.13848
学科分类:社会科学、人文和艺术(综合)
来源: Inra
PDF
【 摘 要 】

BackgroundEfficient identification of microbe-drug associations is critical for drug development and solving problem of antimicrobial resistance. Traditional wet-lab method requires a lot of money and labor in identifying potential microbe-drug associations. With development of machine learning and publication of large amounts of biological data, computational methods become feasible.MethodsIn this article, we proposed a computational model of neighborhood-based inference (NI) and restricted Boltzmann machine (RBM) to predict potential microbe-drug association (NIRBMMDA) by using integrated microbe similarity, integrated drug similarity and known microbe-drug associations. First, NI was used to obtain a score matrix of potential microbe-drug associations by using different thresholds to find similar neighbors for drug or microbe. Second, RBM was employed to obtain another score matrix of potential microbe-drug associations based on contrastive divergence algorithm and sigmoid function. Because generalization ability of individual method is poor, we used an ensemble learning to integrate two score matrices for predicting potential microbe-drug associations more accurately. In particular, NI can fully utilize similar (neighbor) information of drug or microbe and RBM can learn potential probability distribution hid in known microbe-drug associations. Moreover, ensemble learning was used to integrate individual predictor for obtaining a stronger predictor.ResultsIn global leave-one-out cross validation (LOOCV), NIRBMMDA gained the area under the receiver operating characteristics curve (AUC) of 0.8666, 0.9413 and 0.9557 for datasets of DrugVirus, MDAD and aBiofilm, respectively. In local LOOCV, AUCs of 0.8512, 0.9204 and 0.9414 were obtained for NIRBMMDA based on datasets of DrugVirus, MDAD and aBiofilm, respectively. For five-fold cross validation, NIRBMMDA acquired AUC and standard deviation of 0.8569 ± −0.0027, 0.9248 ± −0.0014 and 0.9369 ± −0.0020 on the basis of datasets of DrugVirus, MDAD and aBiofilm, respectively. Moreover, case study for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) showed that 13 out of the top 20 predicted drugs were verified by searching literature. The other two case studies indicated that 17 and 17 out of the top 20 predicted microbes for the drug of ciprofloxacin and minocycline were confirmed by identifying published literature, respectively.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202307100003587ZK.pdf 3060KB PDF download
  文献评价指标  
  下载次数:5次 浏览次数:0次