期刊论文详细信息
BMC Evolutionary Biology
Evolution of complete proteomes: guanine-cytosine pressure, phylogeny and environmental influences blend the proteomic architecture
Fusheng Chen1  Yanchun Shao1  Wanping Chen1 
[1] College of Food Science and Technology, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
关键词: Environmental influence;    Phylogeny;    GC pressure;    Evolution;    Proteome;   
Others  :  1085737
DOI  :  10.1186/1471-2148-13-219
 received in 2013-03-21, accepted in 2013-10-01,  发布年份 2013
PDF
【 摘 要 】

Background

Guanine-cytosine (GC) composition is an important feature of genomes. Likewise, amino acid composition is a distinct, but less valued, feature of proteomes. A major concern is that it is not clear what valuable information can be acquired from amino acid composition data. To address this concern, in-depth analyses of the amino acid composition of the complete proteomes from 63 archaea, 270 bacteria, and 128 eukaryotes were performed.

Results

Principal component analysis of the amino acid matrices showed that the main contributors to proteomic architecture were genomic GC variation, phylogeny, and environmental influences. GC pressure drove positive selection on Ala, Arg, Gly, Pro, Trp, and Val, and adverse selection on Asn, Lys, Ile, Phe, and Tyr. The physico-chemical framework of the complete proteomes withstood GC pressure by frequency complementation of GC-dependent amino acid pairs with similar physico-chemical properties. Gln, His, Ser, and Val were responsible for phylogeny and their constituted components could differentiate archaea, bacteria, and eukaryotes. Environmental niche was also a significant factor in determining proteomic architecture, especially for archaea for which the main amino acids were Cys, Leu, and Thr. In archaea, hyperthermophiles, acidophiles, mesophiles, psychrophiles, and halophiles gathered successively along the environment-based principal component. Concordance between proteomic architecture and the genetic code was also related closely to genomic GC content, phylogeny, and lifestyles.

Conclusions

Large-scale analyses of the complete proteomes of a wide range of organisms suggested that amino acid composition retained the trace of GC variation, phylogeny, and environmental influences during evolution. The findings from this study will help in the development of a global understanding of proteome evolution, and even biological evolution.

【 授权许可】

   
2013 Chen et al.; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20150113180022311.pdf 3088KB PDF download
Figure 8. 94KB Image download
Figure 7. 59KB Image download
Figure 6. 43KB Image download
Figure 5. 135KB Image download
Figure 4. 85KB Image download
Figure 3. 58KB Image download
Figure 2. 78KB Image download
Figure 1. 116KB Image download
【 图 表 】

Figure 1.

Figure 2.

Figure 3.

Figure 4.

Figure 5.

Figure 6.

Figure 7.

Figure 8.

【 参考文献 】
  • [1]Casto AM, Amid C: Beyond the Genome: genomics research ten years after the human genome sequence. Genome Biol 2010, 11:309. BioMed Central Full Text
  • [2]Yao T: Bioinformatics for the genomic sciences and towards systems biology. Japanese activities in the post-genome era. Prog Biophys Mol Biol 2002, 80:23-42.
  • [3]Tekaia F, Yeramian E, Dujon B: Amino acid composition of genomes, lifestyles of organisms, and evolutionary trends: a global picture with correspondence analysis. Gene 2002, 297:51-60.
  • [4]Tekaia F, Yeramian E: Evolution of proteomes: fundamental signatures and global trends in amino acid compositions. BMC Genomics 2006, 7:307. BioMed Central Full Text
  • [5]Smole Z, Nikolic N, Supek F, Smuc T, Sbalzarini IF, Krisko A: Proteome sequence features carry signatures of the environmental niche of prokaryotes. BMC Evol Biol 2011, 11:26. BioMed Central Full Text
  • [6]Schmidt A, Rzanny M, Hagen M, Schutze E, Kothe E: GC content-independent amino acid patterns in Bacteria and Archaea. J Basic Microbiol 2012, 52:195-205.
  • [7]Vieira-Silva S, Rocha EP: An assessment of the impacts of molecular oxygen on the evolution of proteomes. Mol Biol Evol 2008, 25:1931-1942.
  • [8]Sueoka N: Correlation between base composition of deoxyribonucleic acid and amino acid composition of protein. Proc Natl Acad Sci USA 1961, 47:1141-1149.
  • [9]Lobry JR: Influence of genomic G + C content on average amino-acid composition of proteins from 59 bacterial species. Gene 1997, 205:309-316.
  • [10]Chen W, Xie T, Shao Y, Chen F: Genomic characteristics comparisons of 12 food-related filamentous fungi in tRNA gene set, codon usage and amino acid composition. Gene 2012, 497:116-124.
  • [11]Bharanidharan D, Bhargavi GR, Uthanumallian K, Gautham N: Correlations between nucleotide frequencies and amino acid composition in 115 bacterial species. Biochem Bioph Res Co 2004, 315:1097-1103.
  • [12]Lightfield J, Fram NR, Ely B: Across bacterial phyla, distantly-related genomes with similar genomic GC content have similar patterns of amino acid usage. PLoS One 2011, 6:e17677.
  • [13]Singer GAC, Hickey DA: Nucleotide bias causes a genomewide bias in the amino acid composition of proteins. Mol Biol Evol 2000, 17:1581-1588.
  • [14]Knight RD, Freeland SJ, Landweber LF: A simple model based on mutation and selection explains trends in codon and amino-acid usage and GC composition within and across genomes. Genome Biol 2001, 2:0010.
  • [15]Nishio Y, Nakamura Y, Kawarabayasi Y, Usuda Y, Kimura E, Sugimoto S, Matsui K, Yamagishi A, Kikuchi H, Ikeo K, Gojobori T: Comparative complete genome sequence analysis of the amino acid replacements responsible for the thermostability of Corynebacterium efficiens. Genome Res 2003, 13:1572-1579.
  • [16]Pe'er I, Felder CE, Man O, Silman I, Sussman JL, Beckmann JS: Proteomic signatures: Amino acid and oligopeptide compositions differentiate among phyla. Proteins 2004, 54:20-40.
  • [17]Dumontier M, Michalickova K, Hogue CWV: Species-specific protein sequence and fold optimizations. BMC Bioinforma 2002, 3:39. BioMed Central Full Text
  • [18]Nikolic N, Smole Z, Krisko A: Proteomic properties reveal phyloecological clusters of Archaea. PLoS One 2012, 7:e48231.
  • [19]Hickey DA, Singer GAC: Genomic and proteomic adaptations to growth at high temperature. Genome Biol 2004, 5:117. BioMed Central Full Text
  • [20]Fukuchi S, Yoshimune K, Wakayama M, Moriguchi M, Nishikawa K: Unique amino acid composition of proteins in halophilic bacteria. J Mol Biol 2003, 327:347-357.
  • [21]Bardavid RE, Oren A: The amino acid composition of proteins from anaerobic halophilic bacteria of the order Halanaerobiales. Extremophiles 2012, 16:567-572.
  • [22]Brocchieri L: Environmental signatures in proteome properties. Proc Natl Acad Sci U S A 2004, 101:8257-8258.
  • [23]Singer GAC, Hickey DA: Thermophilic prokaryotes have characteristic patterns of codon usage, amino acid composition and nucleotide content. Gene 2003, 317:39-47.
  • [24]Puigbo P, Pasamontes A, Garcia-Vallve S: Gaining and losing the thermophilic adaptation in prokaryotes. Trends Genet 2008, 24:10-14.
  • [25]Zeldovich KB, Berezovsky IN, Shakhnovich EI: Protein and DNA sequence determinants of thermophilic adaptation. PLoS Comput Biol 2007, 3:e5.
  • [26]Wang GZ, Lercher MJ: Amino acid composition in endothermic vertebrates is biased in the same direction as in thermophilic prokaryotes. BMC Evol Biol 2010, 10:263. BioMed Central Full Text
  • [27]Jobson RW, Qiu YL: Amino acid compositional shifts during streptophyte transitions to terrestrial habitats. J Mol Evol 2011, 72:204-214.
  • [28]Hanai R, Wada A: The effects of guanine and cytosine variation on dinucleotide frequency and amino acid composition in the human genome. J Mol Evol 1988, 27:321-325.
  • [29]Karlin S, Burge C: Dinucleotide relative abundance extremes - a genomic signature. Trends Genet 1995, 11:283-290.
  • [30]De Amicis F, Marchetti S: Intercodon dinucleotides affect codon choice in plant genes. Nucleic Acids Res 2000, 28:3339-3345.
  • [31]Gentles AJ, Karlin S: Genome-scale compositional comparisons in eukaryotes. Genome Res 2001, 11:540-546.
  • [32]Alves R, Savageau MA: Evidence of selection for low cognate amino acid bias in amino acid biosynthetic enzymes. Mol Microbiol 2005, 56:1017-1034.
  • [33]Bragg JG, Thomas D, Baudouin-Cornu P: Variation among species in proteomic sulphur content is related to environmental conditions. Proc R Soc B 2006, 273:1293-1300.
  • [34]Akashi H, Gojobori T: Metabolic efficiency and amino acid composition in the proteomes of Escherichia coli and Bacillus subtilis. Proc Natl Acad Sci USA 2002, 99:3695-3700.
  • [35]Jun T, Robert N, Erika W, Erin M, Brad M, Grant H, Tygue A, Kelly H, Rachel H, Manuel R, Mike W: The frequencies of amino acids encoded by genomes that utilize standard and nonstandard genetic codes. BIOS 2010, 81:22-31.
  • [36]Clancy S, Brown W: Translation: DNA to mRNA to protein. Nat Educ 2008, 1:1.
  • [37]Wu HL, Bagby S, van den Elsen JMH: Evolution of the genetic triplet code via two types of doublet codons. J Mol Evol 2005, 61:54-64.
  • [38]Tlusty T: A colorful origin for the genetic code: Information theory, statistical mechanics and the emergence of molecular codes. Phys Life Rev 2010, 7:362-376.
  • [39]Gilis D, Massar S, Cerf NJ, Rooman M: Optimality of the genetic code with respect to protein stability and amino-acid frequencies. Genome Biol 2001, 2:0049.
  • [40]Galtier N, Lobry JR: Relationships between genomic G + C content, RNA secondary structures, and optimal growth temperature in prokaryotes. J Mol Evol 1997, 44:632-636.
  • [41]Hurst LD, Merchant AR: High guanine–cytosine content is not an adaptation to high temperature: a comparative analysis amongst prokaryotes. Proc R Soc B 2001, 268:493-497.
  • [42]Smith DR, Chapman MR: Economical evolution: Microbes reduce the synthetic cost of extracellular proteins. mBio 2010, 1:e00131-10.
  • [43]Heizer EM, Raiford DW, Raymer ML, Doom TE, Miller RV, Krane DE: Amino acid cost and codon-usage biases in 6 prokaryotic genomes: A whole-genome analysis. Mol Biol Evol 2006, 23:1670-1680.
  • [44]Raiford DW, Heizer EM, Miller RV, Doom TE, Raymer ML, Krane DE: Metabolic and translational efficiency in microbial organisms. J Mol Evol 2012, 74:206-216.
  • [45]Raiford DW, Heizer EM, Miller RV, Akashi H, Raymer ML, Krane DE: Do amino acid biosynthetic costs constrain protein evolution in Saccharomyces cerevisiae? J Mol Evol 2008, 67:621-630.
  • [46]Barton MD, Delneri D, Oliver SG, Rattray M, Bergman CM: Evolutionary systems biology of amino acid biosynthetic cost in yeast. PLoS One 2010, 5:e11935.
  • [47]Seligmann H: Cost-minimization of amino acid usage. J Mol Evol 2003, 56:151-161.
  • [48]Wang J, Zhu S, Xu C: Amino acids. 3rd edition. Edited by Xu C. Beijing: Higher Education Press; 2002:124-127. [Biochemistry, Volume 1]
  • [49]Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol 2011, 28:2731-2739.
  • [50]Letunic I, Bork P: Interactive Tree Of Life v2: online annotation and display of phylogenetic trees made easy. Nucleic Acids Res 2011, 39:W475-W478.
  文献评价指标  
  下载次数:141次 浏览次数:9次