PeerJ | |
The synergistic effect of concatenation in phylogenomics: the case in Pantoea | |
article | |
Marike Palmer1  Stephanus N. Venter1  Alistair R. McTaggart1  Martin P.A. Coetzee1  Stephanie Van Wyk1  Juanita R. Avontuur1  Chrizelle W. Beukes1  Gerda Fourie1  Quentin C. Santana1  Magriet A. Van Der Nest1  Jochen Blom3  Emma T. Steenkamp1  | |
[1] Department of Biochemistry, Genetics and Microbiology, DST-NRF Centre of Excellence in Tree Health Biotechnology ,(CTHB) and Forestry and Agricultural Biotechnology Institute ,(FABI), University of Pretoria;Queensland Alliance for Agriculture and Food Innovation, University of Queensland;Bioinformatics and Systems Biology, Justus Liebig Universität Gießen | |
关键词: Phylogenomics; Concatenate; Super trees; Phylogenetics; Phylogenetic signal; Phylogenetic conflict; | |
DOI : 10.7717/peerj.6698 | |
学科分类:社会科学、人文和艺术(综合) | |
来源: Inra | |
【 摘 要 】
With the increased availability of genome sequences for bacteria, it has become routine practice to construct genome-based phylogenies. These phylogenies have formed the basis for various taxonomic decisions, especially for resolving problematic relationships between taxa. Despite the popularity of concatenating shared genes to obtain well-supported phylogenies, various issues regarding this combined-evidence approach have been raised. These include the introduction of phylogenetic error into datasets, as well as incongruence due to organism-level evolutionary processes, particularly horizontal gene transfer and incomplete lineage sorting. Because of the huge effect that this could have on phylogenies, we evaluated the impact of phylogenetic conflict caused by organism-level evolutionary processes on the established species phylogeny for Pantoea, a member of the Enterobacterales. We explored the presence and distribution of phylogenetic conflict at the gene partition and nucleotide levels, by identifying putative inter-lineage recombination events that might have contributed to such conflict. Furthermore, we determined whether smaller, randomly constructed datasets had sufficient signal to reconstruct the current species tree hypothesis or if they would be overshadowed by phylogenetic incongruence. We found that no individual gene tree was fully congruent with the species phylogeny of Pantoea, although many of the expected nodes were supported by various individual genes across the genome. Evidence of recombination was found across all lineages within Pantoea, and provides support for organism-level evolutionary processes as a potential source of phylogenetic conflict. The phylogenetic signal from at least 70 random genes recovered robust, well-supported phylogenies for the backbone and most species relationships of Pantoea, and was unaffected by phylogenetic conflict within the dataset. Furthermore, despite providing limited resolution among taxa at the level of single gene trees, concatenated analyses of genes that were identified as having no signal resulted in a phylogeny that resembled the species phylogeny of Pantoea. This distribution of signal and noise across the genome presents the ideal situation for phylogenetic inference, as the topology from a ≥70-gene concatenated species phylogeny is not driven by single genes, and our data suggests that this finding may also hold true for smaller datasets. We thus argue that, by using a concatenation-based approach in phylogenomics, one can obtain robust phylogenies due to the synergistic effect of the combined signal obtained from multiple genes.
【 授权许可】
CC BY
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202307100010610ZK.pdf | 6157KB | download |