Frontiers in Genetics | |
Predicting the Disease Genes of Multiple Sclerosis Based on Network Representation Learning | |
He Li1  Hansheng Xue5  Jiaojiao Guan5  Zhijie Bao6  Xun Luo7  Qingmei Wang8  Haijie Liu8  | |
[1] Department of Automation, College of Information Science and Engineering, Tianjin Tianshi College, Tianjin, China;Department of Neurology, Xuanwu Hospital, Capital Medical University, Beijing, China;Department of Physical Medicine and Rehabilitation, Tianjin Medical University General Hospital, Tianjin, China;Kerry Rehabilitation Medicine Research Institute, Shenzhen, China;School of Computer Science, Northwestern Polytechnical University, Xi'an, China;School of Textile Science and Engineering, Tiangong University, Tianjin, China;Shenzhen Dapeng New District Nan'ao People's Hospital, Shenzhen, China;Stroke Biological Recovery Laboratory, Department of Physical Medicine and Rehabilitation, Spaulding Rehabilitation Hospital, The Teaching Affiliate of Harvard Medical School Charlestown, Boston, MA, United States; | |
关键词: multiple sclerosis; network embedding; disease gene prediction; PPI network; deep learning; | |
DOI : 10.3389/fgene.2020.00328 | |
来源: DOAJ |
【 摘 要 】
Multiple sclerosis (MS) is an autoimmune disease for which it is difficult to find exact disease-related genes. Effectively identifying disease-related genes would contribute to improving the treatment and diagnosis of multiple sclerosis. Current methods for identifying disease-related genes mainly focus on the hypothesis of guilt-by-association and pay little attention to the global topological information of the whole protein-protein-interaction (PPI) network. Besides, network representation learning (NRL) has attracted a huge amount of attention in the area of network analysis because of its promising performance in node representation and many downstream tasks. In this paper, we try to introduce NRL into the task of disease-related gene prediction and propose a novel framework for identifying the disease-related genes multiple sclerosis. The proposed framework contains three main steps: capturing the topological structure of the PPI network using NRL-based methods, encoding learned features into low-dimensional space using a stacked autoencoder, and training a support vector machine (SVM) classifier to predict disease-related genes. Compared with three state-of-the-art algorithms, our proposed framework shows superior performance on the task of predicting disease-related genes of multiple sclerosis.
【 授权许可】
Unknown