期刊论文详细信息
BMC Bioinformatics
Does the choice of nucleotide substitution models matter topologically?
Research Article
Michael Hoff1  Stefan Orf1  Benedikt Riehm1  Alexandros Stamatakis2  Diego Darriba3 
[1] Karlsruhe Institute of Technology, Department of Informatics, Kaiserstraße 12, 76131, Karlsruhe, Germany;Karlsruhe Institute of Technology, Department of Informatics, Kaiserstraße 12, 76131, Karlsruhe, Germany;The Exelixis Lab, Scientific Computing Group, Heidelberg Institute for Theoretical Studies, Schloss-Wolfsbrunnenweg 35, 69118, Heidelberg, Germany;The Exelixis Lab, Scientific Computing Group, Heidelberg Institute for Theoretical Studies, Schloss-Wolfsbrunnenweg 35, 69118, Heidelberg, Germany;
关键词: Phylogenetics;    Nucleotide substitution;    Model selection;    Information criterion;    BIC;    AIC;   
DOI  :  10.1186/s12859-016-0985-x
 received in 2016-02-26, accepted in 2016-03-09,  发布年份 2016
来源: Springer
PDF
【 摘 要 】

BackgroundIn the context of a master level programming practical at the computer science department of the Karlsruhe Institute of Technology, we developed and make available an open-source code for testing all 203 possible nucleotide substitution models in the Maximum Likelihood (ML) setting under the common Akaike, corrected Akaike, and Bayesian information criteria. We address the question if model selection matters topologically, that is, if conducting ML inferences under the optimal, instead of a standard General Time Reversible model, yields different tree topologies. We also assess, to which degree models selected and trees inferred under the three standard criteria (AIC, AICc, BIC) differ. Finally, we assess if the definition of the sample size (#sites versus #sites × #taxa) yields different models and, as a consequence, different tree topologies.ResultsWe find that, all three factors (by order of impact: nucleotide model selection, information criterion used, sample size definition) can yield topologically substantially different final tree topologies (topological difference exceeding 10 %) for approximately 5 % of the tree inferences conducted on the 39 empirical datasets used in our study.ConclusionsWe find that, using the best-fit nucleotide substitution model may change the final ML tree topology compared to an inference under a default GTR model. The effect is less pronounced when comparing distinct information criteria. Nonetheless, in some cases we did obtain substantial topological differences.

【 授权许可】

CC BY   
© Hoffet al. 2016

【 预 览 】
附件列表
Files Size Format View
RO202311104021302ZK.pdf 954KB PDF download
Fig. 1: The conceptual framework for adherence to treatment guidelines in private drug outlets in Kisumu, Kenya 398KB Image download
MediaObjects/12888_2023_5220_MOESM1_ESM.docx 69KB Other download
Fig. 2 1904KB Image download
12951_2016_246_Article_IEq5.gif 1KB Image download
【 图 表 】

12951_2016_246_Article_IEq5.gif

Fig. 2

Fig. 1: The conceptual framework for adherence to treatment guidelines in private drug outlets in Kisumu, Kenya

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  文献评价指标  
  下载次数:2次 浏览次数:1次