期刊论文详细信息
BMC Genomics
Comparative pangenomics: analysis of 12 microbial pathogen pangenomes reveals conserved global structures of genetic and functional diversity
Jason C. Hyun1  Jonathan M. Monk2  Bernhard O. Palsson2 
[1] Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA, USA;Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA;
关键词: Pangenome;    Core genome;    Comparative genomics;    Multispecies;    Heaps’ law;    Functional diversity;    Sequence diversity;    Protein domains;    Aminoacyl-tRNA synthetases;   
DOI  :  10.1186/s12864-021-08223-8
来源: Springer
PDF
【 摘 要 】

BackgroundWith the exponential growth of publicly available genome sequences, pangenome analyses have provided increasingly complete pictures of genetic diversity for many microbial species. However, relatively few studies have scaled beyond single pangenomes to compare global genetic diversity both within and across different species. We present here several methods for “comparative pangenomics” that can be used to contextualize multi-pangenome scale genetic diversity with gene function for multiple species at multiple resolutions: pangenome shape, genes, sequence variants, and positions within variants.ResultsApplied to 12,676 genomes across 12 microbial pathogenic species, we observed several shared resolution-specific patterns of genetic diversity: First, pangenome openness is associated with species’ phylogenetic placement. Second, relationships between gene function and frequency are conserved across species, with core genomes enriched for metabolic and ribosomal genes and accessory genomes for trafficking, secretion, and defense-associated genes. Third, genes in core genomes with the highest sequence diversity are functionally diverse. Finally, certain protein domains are consistently mutation enriched across multiple species, especially among aminoacyl-tRNA synthetases where the extent of a domain’s mutation enrichment is strongly function-dependent.ConclusionsThese results illustrate the value of each resolution at uncovering distinct aspects in the relationship between genetic and functional diversity across multiple species. With the continued growth of the number of sequenced genomes, these methods will reveal additional universal patterns of genetic diversity at the pangenome scale.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202203111954786ZK.pdf 8013KB PDF download
  文献评价指标  
  下载次数:10次 浏览次数:11次