期刊论文详细信息
BMC Genetics
MCC-SP: a powerful integration method for identification of causal pathways from genetic variants to complex disease
Lu Liu1  Weiqiang Lin1  Yuchen Zhu1  Mingzhuo Li1  Xiujun Li1  Fuzhong Xue1  Zhongshang Yuan1  Huanhuan Zhu2  Xiang Zhou2  Jiadong Ji3 
[1] Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University;Department of Biostatistics, University of Michigan;Department of Data Science, School of Statistics, Shandong University of Finance and Economics;
关键词: Maximum correlation coefficient;    K shortest paths algorithms;    Integration method;    Pathway;    Alzheimer’s disease;   
DOI  :  10.1186/s12863-020-00899-3
来源: DOAJ
【 摘 要 】

Abstract Background Genome-wide association studies (GWAS) have successfully identified genetic susceptible variants for complex diseases. However, the underlying mechanism of such association remains largely unknown. Most disease-associated genetic variants have been shown to reside in noncoding regions, leading to the hypothesis that regulation of gene expression may be the primary biological mechanism. Current methods to characterize gene expression mediating the effect of genetic variant on diseases, often analyzed one gene at a time and ignored the network structure. The impact of genetic variant can propagate to other genes along the links in the network, then to the final disease. There could be multiple pathways from the genetic variant to the final disease, with each having the chain structure since the first node is one specific SNP (Single Nucleotide Polymorphism) variant and the end is disease outcome. One key but inadequately addressed question is how to measure the between-node connection strength and rank the effects of such chain-type pathways, which can provide statistical evidence to give the priority of some pathways for potential drug development in a cost-effective manner. Results We first introduce the maximal correlation coefficient (MCC) to represent the between-node connection, and then integrate MCC with K shortest paths algorithm to rank and identify the potential pathways from genetic variant to disease. The pathway importance score (PIS) was further provided to quantify the importance of each pathway. We termed this method as “MCC-SP”. Various simulations are conducted to illustrate MCC is a better measurement of the between-node connection strength than other quantities including Pearson correlation, Spearman correlation, distance correlation, mutual information, and maximal information coefficient. Finally, we applied MCC-SP to analyze one real dataset from the Religious Orders Study and the Memory and Aging Project, and successfully detected 2 typical pathways from APOE genotype to Alzheimer’s disease (AD) through gene expression enriched in Alzheimer’s disease pathway. Conclusions MCC-SP has powerful and robust performance in identifying the pathway(s) from the genetic variant to the disease. The source code of MCC-SP is freely available at GitHub ( https://github.com/zhuyuchen95/ADnet ).

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:17次