期刊论文详细信息
BMC Bioinformatics
Age-adjusted nonparametric detection of differential DNA methylation with case–control designs
Hanwen Huang1  Zhongxue Chen2  Xudong Huang3 
[1] Department of Epidemiology and Biostatistics, University of Georgia, Athens, GA 30605, USA
[2] Department of Epidemiology and Biostatistics, School of Public Health, Indiana University Bloomington, 1025 E. 7th Street, Bloomington, IN 47405, USA
[3] Neurochemistry Laboratory, Department of Psychiatry, Massachusetts General Hospital and Harvard Medical School, Charlestown, MA 02129, USA
关键词: Combining p-value;    One-sided test;    Nonparametric method;   
Others  :  1087949
DOI  :  10.1186/1471-2105-14-86
 received in 2012-08-13, accepted in 2013-02-20,  发布年份 2013
PDF
【 摘 要 】

Background

DNA methylation profiles differ among disease types and, therefore, can be used in disease diagnosis. In addition, large-scale whole genome DNA methylation data offer tremendous potential in understanding the role of DNA methylation in normal development and function. However, due to the unique feature of the methylation data, powerful and robust statistical methods are very limited in this area.

Results

In this paper, we proposed and examined a new statistical method to detect differentially methylated loci for case control designs that is fully nonparametric and does not depend on any assumption for the underlying distribution of the data. Moreover, the proposed method adjusts for the age effect that has been shown to be highly correlated with DNA methylation profiles. Using simulation studies and a real data application, we have demonstrated the advantages of our method over existing commonly used methods.

Conclusions

Compared to existing methods, our method improved the detection power for differentially methylated loci for case control designs and controlled the type I error well. Its applications are not limited to methylation data; it can be extended to many other case–control studies.

【 授权许可】

   
2013 Huang et al; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20150117061145585.pdf 614KB PDF download
Figure 2. 60KB Image download
Figure 1. 57KB Image download
【 图 表 】

Figure 1.

Figure 2.

【 参考文献 】
  • [1]Baylin SB, Ohm JE: Epigenetic gene silencing in cancer–a mechanism for early oncogenic pathway addiction? Nat Rev Cancer 2006, 6(2):107-116.
  • [2]Kuan PF, Wang S, Zhou X, Chu H: A statistical framework for Illumina DNA methylation arrays. Bioinformatics 2010, 26(22):2849.
  • [3]Rakyan VK, Down TA, Thorne NP, Flicek P, Kulesha E, Gräf S, Tomazou EM, Bäckdahl L, Johnson N, Herberth M: An integrated resource for genome-wide identification and analysis of human tissue-specific differentially methylated regions (tDMRs). Genome Res 2008, 18(9):1518-1529.
  • [4]Kuan PF, Chiang DY: Integrating prior knowledge in multiple testing under dependence with applications to detecting differential DNA methylation. Biometrics 2012.
  • [5]Feinberg AP, Tycko B: The history of cancer epigenetics. Nat Rev Cancer 2004, 4(2):143-153.
  • [6]Jabbari K, Bernardi G: Cytosine methylation and CpG, TpG (CpA) and TpA frequencies. Gene 2004, 333:143-149.
  • [7]Jones PA, Baylin SB: The fundamental role of epigenetic events in cancer. Nat Rev Genet 2002, 3(6):415-428.
  • [8]Kulis M, Esteller M: DNA methylation and cancer. Adv Genet 2010, 70:27-56.
  • [9]Laird PW: Principles and challenges of genome-wide DNA methylation analysis. Nat Rev Genet 2010, 11(3):191-203.
  • [10]Xu GL, Bestor TH, Bourc'his D, Hsieh CL, Tommerup N, Bugge M, Hulten M, Qu X, Russo JJ, Viegas-Péquignot E: Chromosome instability and immunodeficiency syndrome caused by mutations in a DNA methyltransferase gene. Nature 1999, 402(6758):187-191.
  • [11]Siegmund KD, Laird PW, Laird-Offringa IA: A comparison of cluster analysis methods using DNA methylation data. Bioinformatics 2004, 20(12):1896-1904.
  • [12]Houseman EA, Christensen BC, Yeh RF, Marsit CJ, Karagas MR, Wrensch M, Nelson HH, Wiemels J, Zheng S, Wiencke JK: Model-based clustering of DNA methylation array data: a recursive-partitioning algorithm for high-dimensional data arising as a mixture of beta distributions. BMC Bioinformatics 2008, 9(1):365. BioMed Central Full Text
  • [13]Wang S: Method to detect differentially methylated loci with case–control designs using Illumina arrays. Genet Epidemiol 2011, 35(7):686-694.
  • [14]Christensen BC, Houseman EA, Marsit CJ, Zheng S, Wrensch MR, Wiemels JL, Nelson HH, Karagas MR, Padbury JF, Bueno R: Aging and environmental exposures alter tissue-specific DNA methylation dependent upon CpG island context. PLoS Genet 2009, 5(8):e1000602.
  • [15]Teschendorff AE, Menon U, Gentry-Maharaj A, Ramus SJ, Weisenberger DJ, Shen H, Campan M, Noushmehr H, Bell CG, Maxwell AP: Age-dependent DNA methylation of genes that are suppressed in stem cells is a hallmark of cancer. Genome Res 2010, 20(4):440-446.
  • [16]Chen Z, Liu Q, Nadarajah S: A new statistical approach to detecting differentially methylated loci for case control Illumina array methylation data. Bioinformatics 2012, 28(8):1109-1113.
  • [17]Chen Z, Huang H, Liu J, Ng HKT, Nadarajah S, Huang X, Deng Y: Detecting differentially methylated loci for Illumina Array methylation data based on human ovarian cancer data. BMC Med Genomics 2013, 6(Suppl 1):S9.
  • [18]Baumgartner W, Weiß P, Schindler H: A nonparametric test for the general two-sample problem. Biometrics 1998, 54:1129-1135.
  • [19]Neuhäuser M: Exact tests for the analysis of case–control studies of genetic markers. Hum Hered 2002, 54(3):151-156.
  • [20]Fisher RA: Statistical Methods for Research Workers. Edinburgh: Oliver and Boyd; 1932.
  • [21]Owen AB: Karl Pearson's meta-analysis revisited. Ann Statist 2009, 37(6B):3867-3892.
  • [22]Chen Z, Ng HKT: A robust method for testing association in genome-wide association studies. Hum Hered 2012, 73(1):26-34.
  • [23]Chen Z: A new association test based on Chi‐square partition for case‐control GWA studies. Genet Epidemiol 2011, 35(7):658-663.
  • [24]Chen Z, Huang H, Ng HKT: Testing for association in case–control genome-wide association studies with shared controls. Stat Methods Med Res 2013. First published on February 1, 2013
  • [25]Chen Z, Huang H, Ng HKT: Design and analysis of multiple diseases genome-wide association studies without controls. Gene 2012, 510(1):87-92.
  • [26]Chen Z, Liu J, Ng HKT, Nadarajah S, Kaufman HL, Yang JY, Deng Y: Statistical methods on detecting differentially expressed genes for RNA-seq data. BMC Syst Biol 2011, 5(Suppl 3):S1. BioMed Central Full Text
  • [27]Chen Z, McGee M, Liu Q, Kong YM, Huang X, Yang JY, Scheuermann RH: Identifying differentially expressed genes based on probe level data for GeneChip arrays. Int J Comput Biol Drug Des 2010, 3(3):237-257.
  • [28]Chen Z, Liu Q, McGee M, Kong M, Huang X, Deng Y, Scheuermann RH: A gene selection method for GeneChip array data with small sample sizes. BMC Genomics 2011, 12(Suppl 5):S7. BioMed Central Full Text
  • [29]Chen Z, Liu Q: A New approach to account for the correlations among single nucleotide polymorphisms in genome-wide association studies. Hum Hered 2011, 72(1):1-9.
  • [30]Chen Z: Is the weighted z‐test the best method for combining probabilities from independent tests? J Evol Biol 2011, 24(4):926-930.
  • [31]Chen Z, Nadarajah S: Comments on ‘Choosing an optimal method to combine p‐values’ by Sungho Won, Nathan Morris, Qing Lu and Robert C. Elston, Statistics in Medicine 2009; 28: 1537–1553. Stat Med 2011, 30(24):2959-2961.
  • [32]Lancaster H: The combination of probabilities: an application of orthonormal functions. Austral J Statist 1961, 3:20-33.
  文献评价指标  
  下载次数:17次 浏览次数:9次