期刊论文详细信息
BMC Bioinformatics
A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
Han Zhang1  Zhuangwei Shi1  Xiongwen Quan1  Chen Jin2  Yanbin Yin3 
[1] College of Artificial Intelligence, Nankai University, Tongyan Road, 300350, Tianjin, China;College of Computer Science, Nankai University, Tongyan Road, 300350, Tianjin, China;Department of Food Science and Technology, Nebraska Food for Health Center, University of Nebraska-Lincoln, 1400 R Street, 68588, Lincoln, NE, USA;
关键词: Variational inference;    Graph autoencoder;    lncRNA-disease association;    Representation learning;   
DOI  :  10.1186/s12859-021-04073-z
来源: Springer
PDF
【 摘 要 】

BackgroundNumerous studies have demonstrated that long non-coding RNAs are related to plenty of human diseases. Therefore, it is crucial to predict potential lncRNA-disease associations for disease prognosis, diagnosis and therapy. Dozens of machine learning and deep learning algorithms have been adopted to this problem, yet it is still challenging to learn efficient low-dimensional representations from high-dimensional features of lncRNAs and diseases to predict unknown lncRNA-disease associations accurately.ResultsWe proposed an end-to-end model, VGAELDA, which integrates variational inference and graph autoencoders for lncRNA-disease associations prediction. VGAELDA contains two kinds of graph autoencoders. Variational graph autoencoders (VGAE) infer representations from features of lncRNAs and diseases respectively, while graph autoencoders propagate labels via known lncRNA-disease associations. These two kinds of autoencoders are trained alternately by adopting variational expectation maximization algorithm. The integration of both the VGAE for graph representation learning, and the alternate training via variational inference, strengthens the capability of VGAELDA to capture efficient low-dimensional representations from high-dimensional features, and hence promotes the robustness and preciseness for predicting unknown lncRNA-disease associations. Further analysis illuminates that the designed co-training framework of lncRNA and disease for VGAELDA solves a geometric matrix completion problem for capturing efficient low-dimensional representations via a deep learning approach.ConclusionCross validations and numerical experiments illustrate that VGAELDA outperforms the current state-of-the-art methods in lncRNA-disease association prediction. Case studies indicate that VGAELDA is capable of detecting potential lncRNA-disease associations. The source code and data are available at https://github.com/zhanglabNKU/VGAELDA.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202107029104712ZK.pdf 2702KB PDF download
  文献评价指标  
  下载次数:6次 浏览次数:6次