| BMC Bioinformatics | |
| DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood | |
| Tane Kim2  Weilong Hao1  | |
| [1] Department of Biological Sciences, Wayne State University, 48202 Detroit, USA | |
| [2] Mathematics Undergraduate Program, Wayne State University, 48202 Detroit, USA | |
| 关键词: Phylogeny; Maximum likelihood; Birth and death; Gene family evolution; Discrete character states; | |
| Others : 1085776 DOI : 10.1186/1471-2105-15-320 |
|
| received in 2014-07-26, accepted in 2014-09-25, 发布年份 2014 | |
PDF
|
|
【 摘 要 】
Background
The study of discrete characters is crucial for the understanding of evolutionary processes. Even though great advances have been made in the analysis of nucleotide sequences, computer programs for non-DNA discrete characters are often dedicated to specific analyses and lack flexibility. Discrete characters often have different transition rate matrices, variable rates among sites and sometimes contain unobservable states. To obtain the ability to accurately estimate a variety of discrete characters, programs with sophisticated methodologies and flexible settings are desired.
Results
DiscML performs maximum likelihood estimation for evolutionary rates of discrete characters on a provided phylogeny with the options that correct for unobservable data, rate variations, and unknown prior root probabilities from the empirical data. It gives users options to customize the instantaneous transition rate matrices, or to choose pre-determined matrices from models such as birth-and-death (BD), birth-death-and-innovation (BDI), equal rates (ER), symmetric (SYM), general time-reversible (GTR) and all rates different (ARD). Moreover, we show application examples of DiscML on gene family data and on intron presence/absence data.
Conclusion
DiscML was developed as a unified R program for estimating evolutionary rates of discrete characters with no restriction on the number of character states, and with flexibility to use different transition models. DiscML is ideal for the analyses of binary (1s/0s) patterns, multi-gene families, and multistate discrete morphological characteristics.
【 授权许可】
2014 Kim and Hao; licensee BioMed Central Ltd.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| 20150113180440970.pdf | 390KB | ||
| Figure 3. | 66KB | Image | |
| Figure 4. | 19KB | Image | |
| Figure 1. | 50KB | Image |
【 图 表 】
Figure 1.
Figure 4.
Figure 3.
【 参考文献 】
- [1]Lewis PO: A likelihood approach to estimating phylogeny from discrete morphological character data. Syst Biol 2001, 50:913-915.
- [2]Csűrös M: Likely scenarios of intron evolution. In Comparative Genomics. Lecture Notes in Computer Science. Edited by McLysaght A, Huson DH. Berlin: Springer; 2005:47-60.
- [3]Hao W, Golding GB: The fate of laterally transferred genes: life in the fast lane to adaptation or death. Genome Res 2006, 16:636-643.
- [4]Hahn MW, De Bie T, Stajich JE, Nguyen C, Cristianini N: Estimating the tempo and mode of gene family evolution from comparative genomic data. Genome Res 2005, 15:1153-1160.
- [5]van Passel MW, Nijveen H, Wahl LM: Birth, death, and diversification of mobile promoters in prokaryotes. Genetics 2014, 197:291-299.
- [6]Schmitz RJ, Schultz MD, Urich MA, Nery JR, Pelizzola M, Libiger O, Alix A, McCosh RB, Chen H, Schork NJ, Ecker JR: Patterns of population epigenomic diversity. Nature 2013, 495:193-198.
- [7]Paradis E, Claude J, Strimmer K: APE: Analyses of Phylogenetics and Evolution in R language. Bioinformatics 2004, 20:289-290.
- [8]Pagel M, Meade A, Barker D: Bayesian estimation of ancestral character states on phylogenies. Syst Biol 2004, 53:673-674.
- [9]Maddison WP, Maddison DR: Mesquite: a modular system for evolutionary analysis. 2011. Version 2.75, [http://mesquiteproject.org webcite]
- [10]Cohen O, Ashkenazy H, Belinky F, Huchon D, Pupko T: GLOOME: gain loss mapping engine. Bioinformatics 2010, 26:2914-2915.
- [11]Csűrös M: Count: evolutionary analysis of phylogenetic profiles with parsimony and likelihood. Bioinformatics 2010, 26:1910-1912.
- [12]Liu L, Yu L, Kalavacharla V, Liu Z: A Bayesian model for gene family evolution. BMC Bioinformatics 2011, 12:426. BioMed Central Full Text
- [13]Librado P, Vieira FG, Rozas J: BadiRate: estimating family turnover rates by likelihood-based methods. Bioinformatics 2012, 28:279-281.
- [14]Han MV, Thomas GW, Lugo-Martinez J, Hahn MW: Estimating gene gain and loss rates in the presence of error in genome assembly and annotation using CAFE 3. Mol Biol Evol 2013, 30:1987-1987.
- [15]Cohen O, Rubinstein ND, Stern A, Gophna U, Pupko T: A likelihood framework to analyse phyletic patterns. Philos Trans R Soc Lond B Biol Sci 2008, 363:3903-3911.
- [16]Hibbett DS: Trends in morphological evolution in homobasidiomycetes inferred using maximum likelihood: a comparison of binary and multistate approaches. Syst Biol 2004, 53:889-903.
- [17]Gay DM: Usage Summary for Selected Optimization Routines. Computing science technical report 153. Murray Hill: AT&T Bell Laboratories; 1990.
- [18]Yang Z: Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods. J Mol Evol 1994, 39:306-314.
- [19]Yap VB, Speed T: Rooting a phylogenetic tree with nonreversible substitution models. BMC Evol Biol 2005, 5:2. BioMed Central Full Text
- [20]Hao W, Golding GB: Inferring bacterial genome flux while considering truncated genes. Genetics 2010, 186:411-426.
- [21]Waddell PJ, Steel MA: General time-reversible distances with unequal rates across sites mixing gamma and inverse Gaussian distributions with invariant sites. Mol Phylogenet Evol 1997, 8:398-414.
- [22]Felsenstein J: Inferring Phylogenies. Sunderland: Sinauer Associates, Inc.; 2004.
- [23]Boussau B, Gouy M: Efficient likelihood computations with nonreversible models of evolution. Syst Biol 2006, 55:756-758.
- [24]Felsenstein J: Phylogenies from restriction sites: a maximum- likelihood approach. Evolution 1992, 46:159-173.
- [25]Nelder JA, Mead R: A simplex method for function minimization. Comput J 1965, 7:308-313.
- [26]Hao W, Golding GB: Uncovering rate variation of lateral gene transfer during bacterial genome evolution. BMC Genomics 2008, 9:235. BioMed Central Full Text
- [27]Lopez P, Casane D, Philippe H: Heterotachy, an important process of protein evolution. Mol Biol Evol 2002, 19:1-7.
- [28]Wang HC, Spencer M, Susko E, Roger AJ: Testing for covarion-like evolution in protein sequences. Mol Biol Evol 2007, 24:294-305.
- [29]Spencer M, Sangaralingam A: A phylogenetic mixture model for gene family loss in parasitic bacteria. Mol Biol Evol 2009, 26:1901-1908.
- [30]Hao W, Golding GB: Patterns of bacterial gene movement. Mol Biol Evol 2004, 21:1294-1307.
- [31]Wu B, Hao W: Horizontal transfer and gene conversion as an important driving force in shaping the landscape of mitochondrial introns. G3 (Bethesda) 2014, 4:605-612.
PDF