期刊论文详细信息
BMC Genetics
Relative performance of gene- and pathway-level methods as secondary analyses for genome-wide association studies
Priya Duggal2  WH Linda Kao2  Genevieve L Wojcik1 
[1] Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA;Department of Epidemiology, Johns Hopkins University Bloomberg School of Public Health, Baltimore, MD, USA
关键词: Biological Pathways;    Gene Set;    Genome-wide Association Studies;   
Others  :  1164857
DOI  :  10.1186/s12863-015-0191-2
 received in 2014-10-28, accepted in 2015-03-19,  发布年份 2015
PDF
【 摘 要 】

Background

Despite the success of genome-wide association studies (GWAS), there still remains “missing heritability” for many traits. One contributing factor may be the result of examining one marker at a time as opposed to a group of markers that are biologically meaningful in aggregate. To address this problem, a variety of gene- and pathway-level methods have been developed to identify putative biologically relevant associations. A simulation was conducted to systematically assess the performance of these methods. Using genetic data from 4,500 individuals in the Wellcome Trust Case Control Consortium (WTCCC), case–control status was simulated based on an additive polygenic model. We evaluated gene-level methods based on their sensitivity, specificity, and proportion of false positives. Pathway-level methods were evaluated on the relationship between proportion of causal genes within the pathway and the strength of association.

Results

The gene-level methods had low sensitivity (20-63%), high specificity (89-100%), and low proportion of false positives (0.1-6%). The gene-level program VEGAS using only the top 10% of associated single nucleotide polymorphisms (SNPs) within the gene had the highest sensitivity (28.6%) with less than 1% false positives. The performance of the pathway-level methods depended on their reliance upon asymptotic distributions or if significance was estimated in a competitive manner. The pathway-level programs GenGen, GSA-SNP and MAGENTA had the best performance while accounting for potential confounders.

Conclusions

Novel genes and pathways can be identified using the gene and pathway-level methods. These methods may provide valuable insight into the “missing heritability” of traits and provide biological interpretations to GWAS findings.

【 授权许可】

   
2015 Wojcik et al.; licensee BioMed Central.

【 预 览 】
附件列表
Files Size Format View
20150416015418524.pdf 505KB PDF download
Figure 1. 28KB Image download
【 图 表 】

Figure 1.

【 参考文献 】
  • [1]Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, et al.: Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. PNAS 2009, 106:9362-7.
  • [2]Vineis P, Pearce N: Missing heritability in genome-wide association study research. Nat Rev Genet 2010, 11:1.
  • [3]McCarthy MI, Abecasis GR, Cardon LR, Goldstein DB, Little J, Ioannidis JPA, et al.: Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat Rev Genet 2008, 9:356-69.
  • [4]Fridley BL, Biernacka JM: Gene set analysis of SNP data: benefits, challenges, and future directions. Eur J Hum Genet 2011, 19:837-43.
  • [5]la Cruz DO, Wen X, Ke B, Song M, Nicolae DL: Gene, region and pathway level analyses in whole-genome studies. Genet Epidemiol 2010, 34:222-31.
  • [6]Stranger BE, Stahl EA, Raj T: Progress and promise of genome-wide association studies for human complex trait genetics. Genetics 2011, 187:367-83.
  • [7]Biernacka JM, Jenkins GD, Wang L, Moyer AM, Fridley BL: Use of the gamma method for self-contained gene-set analysis of SNP data. European Journal of Human Genetics 2011, 20:565-571.
  • [8]Gauderman WJ, Murcray C, Gilliland F, Conti DV: Testing association between disease and multiple SNPs in a candidate gene. Genet Epidemiol 2007, 31:383-95.
  • [9]Burton PR, Clayton DG, Cardon LR, Craddock N, Deloukas P, Duncanson A, et al.: Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 2007, 447:661-78.
  • [10]Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D: Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 2006, 38:904-9.
  • [11]Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al.: PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am J Hum Genet 2007, 81:559-75.
  • [12]Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al.: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci 2005, 102:15545-50.
  • [13]de Bakker PIW, Yelensky R, Pe’er I, Gabriel SB, Daly MJ, Altshuler D: Efficiency and power in genetic association studies. Nat Genet 2005, 37:1217-23.
  • [14]Veyrieras J-B, Kudaravalli S, Kim SY, Dermitzakis ET, Gilad Y, Stephens M, et al.: High-resolution mapping of expression-QTLs yields insight into human gene regulation. PLoS Genet 2008, 4:e1000214.
  • [15]Peng G, Luo L, Siu H, Zhu Y, Hu P, Hong S, et al.: Gene and pathway-based second-wave analysis of genome-wide association studies. Eur J Hum Genet 2010, 18:111-7.
  • [16]Zaykin DV, Zhivotovsky LA, Westfall PH, Weir BS: Truncated product method for combining P-values. Genet Epidemiol 2002, 22:170-85.
  • [17]Li M-X, Gui H-S, Kwan JSH, Sham PC: GATES: a rapid and powerful gene-based association test using extended Simes procedure. Am J Hum Genet 2011, 88:283-93.
  • [18]Li MX, Kwan J, Sham PC: HYST: a hybrid set-based test for genome-wide association studies, with application to protein-protein interaction-based association analysis. Am J Hum Gen 2012, 7;91(3):478-88. doi:10.1016/j.ajhg.2012.08.004
  • [19]Liu JZ, Mcrae AF, Nyholt DR, Medland SE, Wray NR, Brown KM, et al.: A versatile gene-based test for genome-wide association studies. Am J Hum Genet 2010, 87:139-45.
  • [20]Segrè AV, Groop L, Mootha VK, Daly MJ, Altshuler D: Common Inherited Variation in Mitochondrial Genes Is Not Enriched for Associations with Type 2 Diabetes or Related Glycemic Traits. PLoS Genet 2010, 6(8):e1001058. doi: 10.1371/journal.pgen.1001058
  • [21]Nam D, Kim J, Kim SY, Kim S: GSA-SNP: a general approach for gene set analysis of polymorphisms. Nucleic Acids Res 2010, 38(Web Server):W749-54.
  • [22]Holden M, Deng S, Wojnowski L, Kulle B: GSEA-SNP: applying gene set enrichment analysis to SNP data from genome-wide association studies. Bioinformatics 2008, 24:2784-5.
  • [23]Chen LS, Hutter CM, Potter JD, Liu Y, Prentice RL, Peters U, et al.: AR TICLEInsights into Colon Cancer Etiology via a Regularized Approachto Gene Set Analysis of GWAS Data. Am J Hum Genet 2010, 86:860-71.
  • [24]Holmans P, Green EK, Pahwa JS, Ferreira MAR, Purcell SM, Sklar P, et al.: AR TICLEGene Ontology Analysis of GWA Study Data Sets Provides Insights into the Biology of Bipolar Disorder. Am J Hum Genet 2009, 85:13-24.
  • [25]Wang K, Li M, Bućan M: Pathway-based approaches for analysis of genomewide association studies. Am J Hum Genet 2007, 81:1278-83.
  • [26]Dai H: A modified generalized Fisher method for combining probabilities from dependent tests. Frontiers in Genetics 2014, 5:1-10. Article 32
  • [27]O’Dushlaine C, Kenny E, Heron EA, Segurado R, Gill M, Morris DW, et al.: The SNP ratio test: pathway analysis of genome-wide association datasets. Bioinformatics 2009, 25:2762-3.
  文献评价指标  
  下载次数:11次 浏览次数:22次