BMC Evolutionary Biology | |
Modeling the evolution dynamics of exon-intron structure with a general random fragmentation process | |
Lincoln D Stein1  Liya Wang2  | |
[1] Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5S 1A8, Canada;Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA | |
关键词: Simulation; General random fragmentation process; Evolution of exon-intron structure; | |
Others : 1129592 DOI : 10.1186/1471-2148-13-57 |
|
received in 2012-11-20, accepted in 2013-02-22, 发布年份 2013 | |
【 摘 要 】
Background
Most eukaryotic genes are interrupted by spliceosomal introns. The evolution of exon-intron structure remains mysterious despite rapid advance in genome sequencing technique. In this work, a novel approach is taken based on the assumptions that the evolution of exon-intron structure is a stochastic process, and that the characteristics of this process can be understood by examining its historical outcome, the present-day size distribution of internal translated exons (exon). Through the combination of simulation and modeling the size distribution of exons in different species, we propose a general random fragmentation process (GRFP) to characterize the evolution dynamics of exon-intron structure. This model accurately predicts the probability that an exon will be split by a new intron and the distribution of novel insertions along the length of the exon.
Results
As the first observation from this model, we show that the chance for an exon to obtain an intron is proportional to its size to the 3rd power. We also show that such size dependence is nearly constant across gene, with the exception of the exons adjacent to the 5′ UTR. As the second conclusion from the model, we show that intron insertion loci follow a normal distribution with a mean of 0.5 (center of the exon) and a standard deviation of 0.11. Finally, we show that intron insertions within a gene are independent of each other for vertebrates, but are more negatively correlated for non-vertebrate. We use simulation to demonstrate that the negative correlation might result from significant intron loss during evolution, which could be explained by selection against multi-intron genes in these organisms.
Conclusions
The GRFP model suggests that intron gain is dynamic with a higher chance for longer exons; introns are inserted into exons randomly with the highest probability at the center of the exon. GRFP estimates that there are 78 introns in every 10 kb coding sequences for vertebrate genomes, agreeing with empirical observations. GRFP also estimates that there are significant intron losses in the evolution of non-vertebrate genomes, with extreme cases of around 57% intron loss in Drosophila melanogaster, 28% in Caenorhabditis elegans, and 24% in Oryza sativa.
【 授权许可】
2013 Wang and Stein; licensee BioMed Central Ltd.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
20150226080625918.pdf | 1958KB | download | |
Figure 7. | 49KB | Image | download |
Figure 6. | 64KB | Image | download |
Figure 5. | 206KB | Image | download |
Figure 4. | 108KB | Image | download |
Figure 3. | 107KB | Image | download |
Figure 2. | 90KB | Image | download |
Figure 1. | 41KB | Image | download |
【 图 表 】
Figure 1.
Figure 2.
Figure 3.
Figure 4.
Figure 5.
Figure 6.
Figure 7.
【 参考文献 】
- [1]Gilbert W: The exon theory of genes. Cold Spring Harb Symp Quant Biol 1987, 52:901-905.
- [2]Gilbert W, Glynias M: On the ancient nature of introns. Gene 1993, 135(1–2):137-144.
- [3]Penny D, Hoeppner MP, Poole AM, Jeffares DC: An overview of the introns-first theory. J Mol Evol 2009, 69(5):527-540.
- [4]Stoltzfus A, Spencer DF, Zuker M, Logsdon JM Jr, Doolittle WF: Testing the exon theory of genes: the evidence from protein structure. Science 1994, 265(5169):202-207.
- [5]Logsdon JM Jr, Tyshenko MG, Dixon C, DJ J, Walker VK, Palmer JD: Seven newly discovered intron positions in the triose-phosphate isomerase gene: evidence for the introns-late theory. Proc Natl Acad Sci USA 1995, 92(18):8507-8511.
- [6]Rogozin IB, Carmel L, Csuros M, Koonin EV: Origin and evolution of spliceosomal introns. Biol Direct 2012, 7:11. BioMed Central Full Text
- [7]Csuros M, Rogozin IB, Koonin EV: A detailed history of intron-rich eukaryotic ancestors inferred from a global survey of 100 complete genomes. PLoS Comput Biol 2011, 7(9):e1002150.
- [8]Zhang MQ: Statistical features of human exons and their flanking regions. Hum Mol Genet 1998, 7(5):919-932.
- [9]Gudlaugsdottir S, Boswell DR, Wood GR, Ma J: Exon size distribution and the origin of introns. Genetica 2007, 131(3):299-306.
- [10]Weibull GW: Citation Classic - a Statistical Distribution Function of Wide Applicability. Curr Cont/Eng Technol Appl Sci 1981, 10:18-18.
- [11]Ryabov Y, Gribskov M: Spontaneous symmetry breaking in genome evolution. Nucleic Acids Res 2008, 36(8):2756-2763.
- [12]Kolmogoroff AN: Concerning the logarithmic normal distribution principle of dimensions of particles during dispersal. Cr Acad Sci Urss 1941, 31:99-101.
- [13]Tenchov BG, Yanev TK: Weibull Distribution of Particle Sizes Obtained by Uniform Random Fragmentation. J Colloid Interface Sci 1986, 111(1):1-7.
- [14]Cho G, Doolittle RF: Intron distribution in ancient paralogs supports random insertion and not random loss. J Mol Evol 1997, 44(6):573-584.
- [15]Rogozin IB, Sverdlov AV, Babenko VN, Koonin EV: Analysis of evolution of exon-intron structure of eukaryotic genes. Brief Bioinform 2005, 6(2):118-134.
- [16]Logsdon JM Jr, Palmer JD: Origin of introns--early or late. Nature 1994, 369(6481):526. author reply 527–528
- [17]Matsumoto M, Nishimura T: Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator. ACM Trans Model Comput Simul 1998, 8(1):3-30.
- [18]Coleman TF, Li YY: An interior trust region approach for nonlinear minimization subject to bounds. Siam J Optim 1996, 6(2):418-445.
- [19]Thomas FC, Li YY: On the Convergence of Interior-Reflective Newton Methods for Nonlinear Minimization Subject to Bounds. Math Program 1994, 67(2):189-224.
- [20]Flicek P, Amode MR, Barrell D, Beal K, Brent S, Carvalho-Silva D, Clapham P, Coates G, Fairley S, Fitzgerald S: Ensembl 2012. Nucleic Acids Res 2012, 40(Database issue):D84-90.
- [21]Chen JQ, Wu Y, Yang H, Bergelson J, Kreitman M, Tian D: Variation in the ratio of nucleotide substitution and indel rates across genomes in mammals and bacteria. Mol Biol Evol 2009, 26(7):1523-1531.
- [22]Letunic I, Copley RR, Bork P: Common exon duplication in animals and its role in alternative splicing. Hum Mol Genet 2002, 11(13):1561-1567.
- [23]Long MY, De Souza SJ, Rosenberg C, Gilbert WE: Relationship between "proto-splice sites" and intron phases: Evidence from dicodon analysis. Proc Natl Acad Sci USA 1998, 95(1):219-223.
- [24]Long MY, Rosenberg C: Testing the "proto-splice sites" model of intron origin: Evidence from analysis of intron phase correlations. Mol Biol Evol 2000, 17(12):1789-1796.
- [25]Berget SM: Exon recognition in vertebrate splicing. J Biol Chem 1995, 270(6):2411-2414.
- [26]Cho S, Jin SW, Cohen A, Ellis RE: A phylogeny of caenorhabditis reveals frequent loss of introns during nematode evolution. Genome Res 2004, 14(7):1207-1220.
- [27]Kiontke K, Gavin NP, Raynes Y, Roehrig C, Piano F, Fitch DH: Caenorhabditis phylogeny predicts convergence of hermaphroditism and extensive intron loss. Proc Natl Acad Sci U S A 2004, 101(24):9003-9008.
- [28]Coulombe-Huntington J, Majewski J: Intron loss and gain in Drosophila. Mol Biol Evol 2007, 24(12):2842-2850.
- [29]Jeffares DC, Mourier T, Penny D: The biology of intron gain and loss. Trends Genet 2006, 22(1):16-22.
- [30]Xu G, Guo C, Shan H, Kong H: Divergence of duplicate genes in exon-intron structure. Proc Natl Acad Sci USA 2012, 109(4):1187-1192.
- [31]Yenerall P, Zhou L: Identifying the mechanisms of intron gain: progress and trends. Biol Direct 2012, 7:29. BioMed Central Full Text
- [32]Koonin EV, Csuros M, Rogozin IB: Whence genes in pieces: reconstruction of the exon-intron gene structures of the last eukaryotic common ancestor and other ancestral eukaryotes. Wiley Interdiscip Rev RNA 2013, 4(1):93-105.
- [33]Roy SW, Gilbert W: The evolution of spliceosomal introns: patterns, puzzles and progress. Nat Rev Genet 2006, 7(3):211-221.
- [34]Rodriguez-Trelles F, Tarrio R, Ayala FJ: Origins and evolution of spliceosomal introns. Annu Rev Genet 2006, 40:47-76.
- [35]Koonin EV: The origin of introns and their role in eukaryogenesis: a compromise solution to the introns-early versus introns-late debate. Biol Direct 2006, 1:1-22. BioMed Central Full Text
- [36]Whitney KD, Garland T: Did Genetic Drift Drive Increases in Genome Complexity. PLoS Genet 2010, 6(8):1-6.