BMC Bioinformatics | |
Gene expression and nucleotide composition are associated with genic methylation level in Oryza sativa | |
Tatiana V Tatarinova2  Matteo Pellegrini1  Eran Elhaik3  | |
[1]Molecular, Cell, and Developmental Biology, University of California, 610 Charles Young Drive East, Los Angeles, CA 90095, USA | |
[2]Children’s Hospital Los Angeles, Keck School of Medicine, University of Southern California, 4650 Sunset Blvd, Los Angeles, CA 90027, USA | |
[3]Department of Animal and Plant Sciences, University of Sheffield, Western Bank, Sheffield S10 2TN, UK | |
关键词: Oryza sativa; Prediction; GC3; Gene expression; DNA methylation; | |
Others : 1087644 DOI : 10.1186/1471-2105-15-23 |
|
received in 2013-09-25, accepted in 2013-12-26, 发布年份 2014 | |
【 摘 要 】
Background
The methylation of cytosines at CpG dinucleotides, which plays an important role in gene expression regulation, is one of the most studied epigenetic modifications. Thus far, the detection of DNA methylation has been determined mostly by experimental methods, which are not only prone to bench effects and artifacts but are also time-consuming, expensive, and cannot be easily scaled up to many samples. It is therefore useful to develop computational prediction methods for DNA methylation. Our previous studies highlighted the existence of correlations between the GC content of the third codon position (GC3), methylation, and gene expression. We thus designed a model to predict methylation in Oryza sativa based on genomic sequence features and gene expression data.
Results
We first derive equations to describe the relationship between gene methylation levels, GC3, expression, length, and other gene compositional features. We next assess gene compositional features involving sixmers and their association with methylation levels and other gene level properties. By applying our sixmer-based approach on rice gene expression data we show that it can accurately predict methylation (Pearson’s correlation coefficient r = 0.79) for the majority (79%) of the genes. Matlab code with our model is included.
Conclusions
Gene expression variation can be used as predictors of gene methylation levels.
【 授权许可】
2014 Elhaik et al.; licensee BioMed Central Ltd.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
20150117024945857.pdf | 688KB | download | |
Figure 4. | 56KB | Image | download |
Figure 3. | 106KB | Image | download |
Figure 2. | 144KB | Image | download |
Figure 1. | 42KB | Image | download |
【 图 表 】
Figure 1.
Figure 2.
Figure 3.
Figure 4.
【 参考文献 】
- [1]Slack JM: Conrad Hal Waddington: the last Renaissance biologist? Nat Rev Genet 2002, 3:889-895.
- [2]Sadikovic B, Al-Romaih K, Squire JA, Zielenska M: Cause and consequences of genetic and epigenetic alterations in human cancer. Curr Genomics 2008, 9:394-408.
- [3]Li D, Da L, Tang H, Li T, Zhao M: CpG methylation plays a vital role in determining tissue- and cell-specific expression of the human cell-death-inducing DFF45-like effector A gene through the regulation of Sp1/Sp3 binding. Nucleic Acids Res 2008, 36:330-341.
- [4]Chodavarapu RK, Feng S, Ding B, Simon SA, Lopez D, Jia Y, et al.: Transcriptome and methylome interactions in rice hybrids. Proc Natl Acad Sci U S A 2012, 109:12040-12045.
- [5]Warnecke PM, Stirzaker C, Song J, Grunau C, Melki JR, Clark SJ: Identification and resolution of artifacts in bisulfite sequencing. Methods 2002, 27:101-107.
- [6]Warnecke PM, Stirzaker C, Melki JR, Millar DS, Paul CL, Clark SJ: Detection and measurement of PCR bias in quantitative methylation analysis of bisulphite-treated DNA. Nucleic Acids Res 1997, 25:4422-4426.
- [7]Darst RP, Pardo CE, Ai L, Brown KD, Kladde MP: Bisulfite sequencing of DNA. Curr Protoc Mol Biol 2010, 9:1-17. Chapter 7:Unit 7
- [8]Fang F, Fan S, Zhang X, Zhang MQ: Predicting methylation status of CpG islands in the human brain. Bioinformatics 2006, 22:2204-2209.
- [9]Bock C, Von Kuster G, Halachev K, Taylor J, Nekrutenko A, Lengauer T: Web-based analysis of (Epi-) genome data using EpiGRAPH and Galaxy. Methods Mol Biol 2010, 628:275-296.
- [10]Bock C, Paulsen M, Tierling S, Mikeska T, Lengauer T, Walter J: CpG island methylation in human lymphocytes is highly correlated with DNA sequence, repeats, and predicted DNA structure. PLoS Genet 2006, 2:e26.
- [11]Das R, Dimitrova N, Xuan Z, Rollins RA, Haghighi F, Edwards JR, et al.: Computational prediction of methylation status in human genomic sequences. Proc Natl Acad Sci U S A 2006, 103:10713-10716.
- [12]Bhasin M, Zhang H, Reinherz E, Reche P: Prediction of methylated CpGs in DNA sequences using a support vector machine. FEBS Lett 2005, 579:4302-4308.
- [13]Rakyan VK, Hildmann T, Novik K, Lewin J, Tost J, Cox A, et al.: DNA methylation profiling of the human major histocompatibility complex: a pilot study for the human epigenome project. PLoS Biol 2004, 2(12):e405.
- [14]Zhou X, Li Z, Dai Z, Zou X: Prediction of methylation CpGs and their methylation degrees in human DNA sequences. Comput Biol Med 2012, 42:408-413.
- [15]Phillips T: The role of methylation in gene expression. Nat Educ 2008., 1(1)
- [16]Sun Z, Asmann Y, Kalari K, Bot B, Eckel-Passow J, Baker T, et al.: Integrated analysis of gene expression, CpG island methylation, and gene copy number in breast cancer cells by deep sequencing. PLoS One 2011, 6:e17490.
- [17]Kawahara Y, de la Bastide M, Hamilton J, Kanamori H, McCombie W, Ouyang S, et al.: Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. Rice 2013, 6(1):1-10. BioMed Central Full Text
- [18]Tatarinova T, Elhaik E, Pellegrini M: Cross-species analysis of genic GC3 content and DNA methylation patterns. Genome Biol Evol 2013, 5:1443-1456.
- [19]Tatarinova T, Alexandrov N, Bouck J, Feldmann K: GC3 biology in corn, rice, sorghum and other grasses. BMC Genomics 2010, 11(1):308. BioMed Central Full Text
- [20]Elhaik E, Tatarinova T: GC3 Biology in Eukaryotes and Prokaryotes. In DNA Methylation - From Genomics to Technology Edited by In Tech, Tatarinova T, Kerton O. 2012, 55-68.
- [21]Snee RD: Validation of regression models: methods and examples. Technometrics 1977, 19:415-428.