学位论文详细信息
Site-to-site Rate Variation in Protein Coding Genes
codon model;synonymous and nonsynonymous substitution rates;site-to-site rate heterogeneity;molecular evolution;gamma distribution;rate correlation
Mannino, Frank Vincent ; Spencer Muse, Committee Chair,William Atchley, Committee Member,Jeffrey Thorne, Committee Member,Bruce Weir, Committee Member,Mannino, Frank Vincent ; Spencer Muse ; Committee Chair ; William Atchley ; Committee Member ; Jeffrey Thorne ; Committee Member ; Bruce Weir ; Committee Member
University:North Carolina State University
关键词: codon model;    synonymous and nonsynonymous substitution rates;    site-to-site rate heterogeneity;    molecular evolution;    gamma distribution;    rate correlation;   
Others  :  https://repository.lib.ncsu.edu/bitstream/handle/1840.16/4400/etd.pdf?sequence=1&isAllowed=y
美国|英语
来源: null
PDF
【 摘 要 】

The ability to realistically model gene evolution improved dramatically with the rejection of the assumption that rates are constant across sites. Rate heterogeneity models allow for better estimates of parameters and site specific inferences such as the detection of positive selection. Recently developed models of codon evolution allow for both synonymous and nonsynonymous rates to vary independently according to discretized gamma distributions. I applied this model to mitochondrial genomes and concluded that synonymous rate variation is present in many genes, and is of appreciable magnitude relative to the amount of nonsynonymous heterogeneity. I then extending this model to allow for the two rates to vary according to a dependent bivariate distribution, permitting tests for the significance of correlation of rates within a gene. I present here the algorithm to discretize this bivariate distribution and the application of the model to many real data sets. Significant correlation between synonymous and nonsynonymous rates exists in roughly half of the data sets that I examined, and the correlation is typically positive. These data sets range over a wide group of taxa and genes, implying that the trend of correlation is general. Finally, I performed a thorough investigation of the statistical properties of using discretized gamma distributions to model rate variation, looking at the bias and variance in parameter estimates. These discretized distributions are common in modeling heterogeneity, but have weaknesses that must be well understood before making inferences.

【 预 览 】
附件列表
Files Size Format View
Site-to-site Rate Variation in Protein Coding Genes 1827KB PDF download
  文献评价指标  
  下载次数:15次 浏览次数:38次