BMC Bioinformatics | |
EmpPrior: using outside empirical data to inform branch-length priors for Bayesian phylogenetics | |
Software | |
Jeremy M. Brown1  Bradley J. Nelson2  John J. Andersen3  | |
[1] Department of Biological Sciences and Museum of Natural Science, Louisiana State University, 202 Life Sciences Building, 70803, Baton Rouge, LA, USA;Department of Biological Sciences and Museum of Natural Science, Louisiana State University, 202 Life Sciences Building, 70803, Baton Rouge, LA, USA;Present Address: Department of Genome Sciences, University of Washington, Foege Building S-250, 98195, Seattle, WA, USA;Department of Biological Sciences and Museum of Natural Science, Louisiana State University, 202 Life Sciences Building, 70803, Baton Rouge, LA, USA;Present Address: IBM, 11501 Burnet Rd., 78758, Austin, TX, USA; | |
关键词: Bayesian phylogenetics; Informed priors; Branch lengths; | |
DOI : 10.1186/s12859-016-1132-4 | |
received in 2016-02-16, accepted in 2016-06-09, 发布年份 2016 | |
来源: Springer | |
【 摘 要 】
BackgroundBranch-length parameters are a central component of phylogenetic models and of intrinsic biological interest. Default branch-length priors in some Bayesian phylogenetic software can be unintentionally informative and lead to branch- and tree-length estimates that are unreasonable. Alternatively, priors may be uninformative, but lead to diffuse posterior estimates. Despite the widespread availability of relevant datasets from other groups, biologists rarely leverage outside information to specify branch-length priors that are specific to the analysis they are conducting.ResultsWe developed the software package EmpPrior to facilitate the collection and incorporation of relevant, outside information when setting branch-length priors for phylogenetics. EmpPrior efficiently queries TreeBASE to find data that are similar to focal data, in terms of taxonomic and genetic sampling, and uses them to inform branch-length priors for the focal analysis. EmpPrior consists of two components: EmpPrior-search, written in Java to query TreeBASE, and EmpPrior-fit, written in R to parameterize branch-length distributions. In an example analysis, we show how the use of relevant, outside data is made possible by EmpPrior and improves tree-length estimates from a focal dataset.ConclusionEmpPrior is easy to use, fast, and improves both the accuracy and precision of branch-length estimates in many circumstances. While EmpPrior’s focus is on branch lengths, the strategy it employs could easily be extended to address other prior parameterization problems in phylogenetics.
【 授权许可】
CC BY
© The Author(s). 2016
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202311096074821ZK.pdf | 868KB | download |
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]