期刊论文详细信息
Virology Journal
Analysis of 3.5 million SARS-CoV-2 sequences reveals unique mutational trends with consistent nucleotide and codon frequencies
Research
Anton A. Komar1  Haim Bar2  Nigam H. Padhiar3  Chava Kimchi-Sarfaty3  Sarah E. Fumagalli3  Douglas Meyer3  Upendra Katneni3  Michael DiCuccio4 
[1] Department of Biological, Geological and Environmental Sciences, Center for Gene Regulation in Health and Disease, Cleveland State University, Cleveland, OH, USA;Department of Statistics, University of Connecticut, Storrs, CT, USA;Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA;Rockville, USA;
关键词: SARS-CoV-2;    Nucleotide usage;    Codon usage bias;    Relative synonymous codon usage;    Codon adaptation index and dN/dS;   
DOI  :  10.1186/s12985-023-01982-8
 received in 2022-12-13, accepted in 2023-02-02,  发布年份 2023
来源: Springer
PDF
【 摘 要 】

BackgroundSince the onset of the SARS-CoV-2 pandemic, bioinformatic analyses have been performed to understand the nucleotide and synonymous codon usage features and mutational patterns of the virus. However, comparatively few have attempted to perform such analyses on a considerably large cohort of viral genomes while organizing the plethora of available sequence data for a month-by-month analysis to observe changes over time. Here, we aimed to perform sequence composition and mutation analysis of SARS-CoV-2, separating sequences by gene, clade, and timepoints, and contrast the mutational profile of SARS-CoV-2 to other comparable RNA viruses.MethodsUsing a cleaned, filtered, and pre-aligned dataset of over 3.5 million sequences downloaded from the GISAID database, we computed nucleotide and codon usage statistics, including calculation of relative synonymous codon usage values. We then calculated codon adaptation index (CAI) changes and a nonsynonymous/synonymous mutation ratio (dN/dS) over time for our dataset. Finally, we compiled information on the types of mutations occurring for SARS-CoV-2 and other comparable RNA viruses, and generated heatmaps showing codon and nucleotide composition at high entropy positions along the Spike sequence.ResultsWe show that nucleotide and codon usage metrics remain relatively consistent over the 32-month span, though there are significant differences between clades within each gene at various timepoints. CAI and dN/dS values vary substantially between different timepoints and different genes, with Spike gene on average showing both the highest CAI and dN/dS values. Mutational analysis showed that SARS-CoV-2 Spike has a higher proportion of nonsynonymous mutations than analogous genes in other RNA viruses, with nonsynonymous mutations outnumbering synonymous ones by up to 20:1. However, at several specific positions, synonymous mutations were overwhelmingly predominant.ConclusionsOur multifaceted analysis covering both the composition and mutation signature of SARS-CoV-2 gives valuable insight into the nucleotide frequency and codon usage heterogeneity of SARS-CoV-2 over time, and its unique mutational profile compared to other RNA viruses.

【 授权许可】

CC BY   
© This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2023

【 预 览 】
附件列表
Files Size Format View
RO202305156934538ZK.pdf 5038KB PDF download
Fig. 8 2349KB Image download
MediaObjects/40798_2023_559_MOESM1_ESM.pdf 135KB PDF download
Fig. 4 145KB Image download
12888_2023_4583_Article_IEq4.gif 1KB Image download
Fig. 3 2474KB Image download
Fig. 1 119KB Image download
Fig. 4 2099KB Image download
Fig. 1 151KB Image download
Fig. 1 2175KB Image download
Fig. 1 88KB Image download
Fig. 1 87KB Image download
Fig. 8 444KB Image download
Fig. 1 136KB Image download
Fig. 9 78KB Image download
42004_2023_830_Article_IEq16.gif 1KB Image download
Fig. 3 218KB Image download
Fig. 1 48KB Image download
MediaObjects/12888_2023_4616_MOESM1_ESM.docx 12KB Other download
Fig. 1 84KB Image download
Fig. 5 3245KB Image download
Fig. 4 1792KB Image download
Fig. 2 245KB Image download
Fig. 8 1237KB Image download
Fig. 2 1037KB Image download
Fig. 1 816KB Image download
Fig. 3 305KB Image download
MediaObjects/40360_2023_645_MOESM1_ESM.zip 10147KB Package download
12936_2023_4483_Article_IEq25.gif 1KB Image download
Fig. 6 2907KB Image download
Fig. 3 186KB Image download
Fig. 1 224KB Image download
13690_2023_1029_Article_IEq12.gif 1KB Image download
13690_2023_1029_Article_IEq23.gif 1KB Image download
40249_2023_1061_Article_IEq22.gif 1KB Image download
MediaObjects/40249_2023_1061_MOESM2_ESM.pdf 415KB PDF download
【 图 表 】

40249_2023_1061_Article_IEq22.gif

13690_2023_1029_Article_IEq23.gif

13690_2023_1029_Article_IEq12.gif

Fig. 1

Fig. 3

Fig. 6

12936_2023_4483_Article_IEq25.gif

Fig. 3

Fig. 1

Fig. 2

Fig. 8

Fig. 2

Fig. 4

Fig. 5

Fig. 1

Fig. 1

Fig. 3

42004_2023_830_Article_IEq16.gif

Fig. 9

Fig. 1

Fig. 8

Fig. 1

Fig. 1

Fig. 1

Fig. 1

Fig. 4

Fig. 1

Fig. 3

12888_2023_4583_Article_IEq4.gif

Fig. 4

Fig. 8

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  • [39]
  • [40]
  • [41]
  • [42]
  • [43]
  • [44]
  • [45]
  • [46]
  • [47]
  • [48]
  • [49]
  • [50]
  • [51]
  • [52]
  • [53]
  • [54]
  • [55]
  • [56]
  • [57]
  • [58]
  • [59]
  • [60]
  • [61]
  • [62]
  • [63]
  • [64]
  • [65]
  文献评价指标  
  下载次数:0次 浏览次数:1次