期刊论文详细信息
BMC Bioinformatics
A fast method for calculating reliable event supports in tree reconciliations via Pareto optimality
Research Article
Thu-Hien To1  Edwin Jacox1  Celine Scornavacca1  Vincent Ranwez2 
[1] ISEM - Université de Montpellier, CNRS, IRD, EPHE, Place Eugène Bataillon, 34392, Montpellier, France;Institut de Biologie Computationnelle (IBC), 34095, Montpellier, France;Montpellier SupAgro, UMR AGAP, 2 Place P. Viala, 34060, Montpellier, France;Institut de Biologie Computationnelle (IBC), 34095, Montpellier, France;
关键词: Tree reconciliation;    Gene evolution;    Phylogenetics;    Parsimony;    Supports;   
DOI  :  10.1186/s12859-015-0803-x
 received in 2015-05-03, accepted in 2015-10-26,  发布年份 2015
来源: Springer
PDF
【 摘 要 】

BackgroundGiven a gene and a species tree, reconciliation methods attempt to retrieve the macro-evolutionary events that best explain the discrepancies between the two tree topologies. The DTL parsimonious approach searches for a most parsimonious reconciliation between a gene tree and a (dated) species tree, considering four possible macro-evolutionary events (speciation, duplication, transfer, and loss) with specific costs. Unfortunately, many events are erroneously predicted due to errors in the input trees, inappropriate input cost values or because of the existence of several equally parsimonious scenarios. It is thus crucial to provide a measure of the reliability for predicted events. It has been recently proposed that the reliability of an event can be estimated via its frequency in the set of most parsimonious reconciliations obtained using a variety of reasonable input cost vectors. To compute such a support, a straightforward but time-consuming approach is to generate the costs slightly departing from the original ones, independently compute the set of all most parsimonious reconciliations for each vector, and combine these sets a posteriori. Another proposed approach uses Pareto-optimality to partition cost values into regions which induce reconciliations with the same number of DTL events. The support of an event is then defined as its frequency in the set of regions. However, often, the number of regions is not large enough to provide reliable supports.ResultsWe present here a method to compute efficiently event supports via a polynomial-sized graph, which can represent all reconciliations for several different costs. Moreover, two methods are proposed to take into account alternative input costs: either explicitly providing an input cost range or allowing a tolerance for the over cost of a reconciliation. Our methods are faster than the region based method, substantially faster than the sampling-costs approach, and have a higher event-prediction accuracy on simulated data.ConclusionsWe propose a new approach to improve the accuracy of event supports for parsimonious reconciliation methods to account for uncertainty in the input costs. Furthermore, because of their speed, our methods can be used on large gene families. Our algorithms are implemented in the ecceTERA program, freely available from http://mbb.univ-montp2.fr/MBB/.

【 授权许可】

CC BY   
© To et al. 2015

【 预 览 】
附件列表
Files Size Format View
RO202311100607085ZK.pdf 919KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  文献评价指标  
  下载次数:1次 浏览次数:1次