| BMC Research Notes | |
| Implication of the cause of differences in 3D structures of proteins with high sequence identity based on analyses of amino acid sequences and 3D structures | |
| Takeshi Kikuchi2  Masatake Sugita2  Masanari Matsuoka1  | |
| [1] Japan Society for the Promotion of Science (JSPS) Research Fellow DC2, Tokyo, Japan;Department of Bioinformatics, College of Life Sciences, Ritsumeikan University, 1-1-1 Nojihigashi, Kusatsu, Shiga, Japan | |
| 关键词: Protein G; Protein A; IgG binding protein; Sequence tendency; Conservation analysis; Inter-residue average distance statistics; Sequence analysis; Chameleon sequence; Artificial homologues; | |
| Others : 1129378 DOI : 10.1186/1756-0500-7-654 |
|
| received in 2014-07-29, accepted in 2014-09-05, 发布年份 2014 | |
PDF
|
|
【 摘 要 】
Background
Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem.
Results
It was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation.
Conclusions
The results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.
【 授权许可】
2014 Matsuoka et al.; licensee BioMed Central Ltd.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| 20150226040533335.pdf | 2278KB | ||
| Figure 12. | 20KB | Image | |
| Figure 11. | 43KB | Image | |
| Figure 10. | 67KB | Image | |
| Figure 9. | 207KB | Image | |
| Figure 8. | 45KB | Image | |
| Figure 7. | 27KB | Image | |
| Figure 6. | 34KB | Image | |
| Figure 5. | 43KB | Image | |
| Figure 4. | 71KB | Image | |
| Figure 3. | 43KB | Image | |
| Figure 2. | 42KB | Image | |
| Figure 1. | 56KB | Image |
【 图 表 】
Figure 1.
Figure 2.
Figure 3.
Figure 4.
Figure 5.
Figure 6.
Figure 7.
Figure 8.
Figure 9.
Figure 10.
Figure 11.
Figure 12.
【 参考文献 】
- [1]Rost B: Twilight zone of protein sequence alignments. Protein Eng 1999, 12:85-94.
- [2]He Y, Rozak DA, Sari N, Chen Y, Bryan P, Orban J: Structure, dynamics, and stability variation in bacterial albumin binding modules: implications for species specificity. Biochemistry 2006, 45:10102-10109.
- [3]He Y, Chen Y, Alexander PA, Bryan PN, Orban J: Mutational tipping points for switching protein folds and functions. Structure 2012, 20:283-291.
- [4]Shen Y, Bryan PN, He Y, Orban J, Baker D, Bax A: De novo structure generation using chemical shifts for proteins with high-sequence identity but different folds. Protein Sci 2010, 19:349-356.
- [5]Allison JR, Bergeler M, Hansen N, van Gunsteren WF: Current computer modeling cannot explain Why Two highly similar sequences fold into different structures. Biochemistry 2011, 50:10965-10973.
- [6]Kikuchi T, Némethy G, Scheraga HA: Prediction of the location of structural domains in globular proteins. J Protein Chem 1988, 7:427-471.
- [7]Ichimaru T, Kikuchi T: Analysis of the differences in the folding kinetics of structurally homologous proteins based on predictions of the gross features of residue contacts. Proteins 2003, 51:515-530.
- [8]Nakajima S, Alvarez-Salgado E, Kikuchi T, Arredondo-Peter R: Prediction of folding pathway and kinetics among plant hemoglobins using an average distance map method. Proteins 2005, 61:500-506.
- [9]Nakajima S, Kikuchi T: Analysis of the differences in the folding mechanisms of c-type lysozymes based on contact maps constructed with interresidue average distances. J Mol Model 2007, 13:587-594.
- [10]Ishizuka Y, Kikuchi T: Analysis of the local sequences of folding sites in sandwich proteins with inter-residue average distance statistics. Open Bioinf J 2011, 5:59-68.
- [11]Kikuchi T: Analysis of 3D structural differences in the IgG-binding domains based on the interresidue average-distance statistics. Amino Acids 2008, 35:541-549.
- [12]Matsuoka M, Kikuchi T: Sequence analysis on the information of folding initiation segments in ferredoxin-like fold proteins. BMC Struct Biol 2014, 14:15.
- [13]Ptitsyn OB: Protein folding and protein evolution: common folding nucleus in different subfamilies of c-type cytochromes? J Mol Biol 1998, 278:655-666.
- [14]Mirny L, Shakhnovich E: Evolutionary conservation of the folding nucleus. J Mol Biol 2001, 308:123-129.
- [15]Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990, 215:403-410.
- [16]Kikuchi T: Inter-Cα atomic potentials derived from the statistics of average interresidue distances in proteins: application to bovine pancreatic trypsin inhibitor. J Comput Chem 1996, 17:226-237.
- [17]Daggett V, Fersht AR: Transition states in protein folding. In Mechanisms of protein folding. 2nd edition. Edited by Pain RH. New York: Oxford University Press; 2000:175-211.
- [18]Koga N, Takada S: Roles of native topology and chain-length scaling in protein folding: a simulation study with a Gō-like model. J Mol Biol 2001, 313:171-180.
- [19]Chavez LL, Onuchic JN, Clementi C: Quantifying the roughness on the free energy landscape: entropic bottlenecks and protein folding rates. J Am Chem Soc 2004, 126:8426-8432.
- [20]Clementi C, Nymeyer H, Onuchic JN: Topological and energetic factors: what determines the structural details of the transition state ensemble and "en-route" intermediates for protein folding? An investigation for small globular proteins. J Mol Biol 2000, 298:937-953.
- [21]Karanicolas J, Brooks CL: Improved Gō-like models demonstrate the robustness of protein folding mechanisms towards non-native interactions. J Mol Biol 2003, 334:309-325.
- [22]Karanicolas J, Brooks CL: The origins of asymmetry in the folding transition states of protein L and protein G. Protein Sci 2002, 11:2351-2361.
- [23]Clementi C, García AE, Onuchic JN: Interplay among tertiary contacts, secondary structure formation and side-chain packing in the protein folding mechanism: all-atom representation study of protein L. J Mol Biol 2003, 326:933-954.
- [24]Luo Z, Ding J, Zhou Y: Folding mechanisms of individual beta-hairpins in a Gō model of Pin1 WW domain by all-atom molecular dynamics simulations. J Chem Phys 2008, 128:225103.
- [25]Prieto L, Rey A: Influence of the native topology on the folding barrier for small proteins. J Chem Phys 2007, 127:175101.
- [26]Dokholyan NV, Buldyrev SV, Stanley HE, Shakhnovich EI: Discrete molecular dynamics studies of the folding of a protein-like model. Fold Des 1998, 3:577-587.
- [27]Sugita M, Kikuchi T: Incorporating into a Cα Gō model the effects of geometrical restriction on Cα atoms caused by side chain orientations. Proteins Struct Funct Bioinf 2013, 81:1434-1445.
PDF