| BioData Mining | |
| Graph representation of high-dimensional alpha-helical membrane protein data | |
| Steffen Grunert1  Dirk Labudde1  | |
| [1] Department of Mathematics, Sciences, Computer Science, University of Applied Sciences Mittweida, Technikumplatz 17, Mittweida 09648, Germany | |
| 关键词: Architecture; Graph; Motifs; Membrane proteins; | |
| Others : 797144 DOI : 10.1186/1756-0381-6-21 |
|
| received in 2013-08-07, accepted in 2013-11-26, 发布年份 2013 | |
PDF
|
|
【 摘 要 】
Background
In genomics and proteomics, membrane protein analysis have shown that such analyses are very important to support the understanding of complex biological processes. In Genome-wide investigations of membrane proteins a large number of short, distinct sequence motifs has been revealed. Such motifs found so far support the understanding of the folded membrane protein in the membrane environment. They provide important information about functional or stabilizing properties. Recently several integrative approaches have been proposed to extract meaningful information out of the membrane environment. However, many information based approaches deliver results having deficits of visualisation outputs. Outgoing from high-throughput protein data analysis, these outputs play an important role in the evaluation of high-dimensional protein data, to establish a biological relationship and ultimately to provide useful information for research.
Results
We have evaluated different resulting graphs generated from statistical analysis of consecutive motifs in helical structures of the membrane environment. Our results show that representative motifs with high occurrence in all investigated protein families are responsible for the general importance in alpha-helical membrane structure formation. Further, motifs which often occur with others in their function as so called “hubs” lead to the assumption, that these motifs constitute as important components in helical structures within the membrane. Otherwise, consecutive motifs and hubs which show a high occurrence in certain families only can be classified as important for family-specific functional characteristics. Summarized, we are able to bridge our graphical results from high-throughput analysis of membrane proteins over networking with databases to a biological context.
Conclusions
Our results and the corresponding graphical visualisation support the understanding and interpretation of structure forming and functional motifs of membrane proteins. Our results are useful to interpret and refine results of common developed approaches. At last we show a simple way to visualise high-dimensional protein data in context to biological relevant information.
【 授权许可】
2013 Grunert and Labudde; licensee BioMed Central Ltd.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| 20140706040840330.pdf | 3199KB | ||
| Figure 11. | 46KB | Image | |
| Figure 10. | 38KB | Image | |
| Figure 9. | 34KB | Image | |
| Figure 8. | 75KB | Image | |
| Figure 7. | 70KB | Image | |
| Figure 6. | 135KB | Image | |
| Figure 5. | 84KB | Image | |
| Figure 4. | 43KB | Image | |
| Figure 3. | 82KB | Image | |
| Figure 2. | 53KB | Image | |
| Figure 1. | 114KB | Image |
【 图 表 】
Figure 1.
Figure 2.
Figure 3.
Figure 4.
Figure 5.
Figure 6.
Figure 7.
Figure 8.
Figure 9.
Figure 10.
Figure 11.
【 参考文献 】
- [1]Eisenberg D, Marcotte EM, Xenarios I, Yeates TO: Protein function in the post-genomic era. Nature 2000, 405(6788):823-826.
- [2]Luckey M: Membrane Structural Biology.. Cambridge University Press; 2008.
- [3]Singer SJ, Nicolson GL, et al.: The fluid mosaic model of the structure of cell membranes. Science 1972, 175(23):720-731.
- [4]Venkatakrishnan A, Deupi X, Lebon G, Tate CG, Schertler GF, Babu MM: Molecular signatures of g-protein-coupled receptors. Nature 2013, 494(7436):185-194.
- [5]Lan N, Montelione GT, Gerstein M: Ontologies for proteomics: towards a systematic definition of structure and function that scales to the genome level. Curr Opin Chem Biol 2003, 7(1):44-54.
- [6]Marsico A, Labudde D, Sapra T, Muller DJ, Schroeder M: A novel pattern recognition algorithm to classify membrane protein unfolding pathways with high-throughput single-molecule force spectroscopy. Bioinformatics 2007, 23(2):231-236.
- [7]Childers M, Eckel G, Himmel A, Caldwell J: A new model of cystic fibrosis pathology: lack of transport of glutathione and its thiocyanate conjugates. Med Hypotheses 2007, 68(1):101-112.
- [8]Rowe SM, Miller S, Sorscher EJ: Cystic fibrosis. N Engl J Med 2005, 352(19):1992-2001.
- [9]Liu Y, Engelman DM, Gerstein M, et al.: Genomic analysis of membrane protein families: abundance and conserved motifs. Genome Biol 2002, 3(10):1-0054.
- [10]Arkin IT, et al.: Statistical analysis of predicted transmembrane α-helices. Biochimica et Biophysica Acta (BBA)-Protein Struct Mol Enzymol 1998, 1429(1):113-128.
- [11]Senes A, Gerstein M, Engelman D M, et al.: Statistical analysis of amino acid patterns in transmembrane helices: The gxxxg motif occurs frequently, and in association with beta-branched residues at neighboring positions. J Mol Biol 2000, 296(3):921-936.
- [12]Russ WP, Engelman D M, et al.: The gxxxg motif: a framework for transmembrane helix-helix association. J Mol Biol 2000, 296(3):911-919.
- [13]Senes A, Engel DE, DeGrado WF: Folding of helical membrane proteins: the role of polar, gxxxg-like and proline motifs. Curr Opin Struct Biol 2004, 14(4):465-479.
- [14]Grunert S, Heinke F, Labudde D: Structure topology prediction of discriminative sequence motifs in membrane proteins with domains of unknown functions. Struct Biol 2013, 2013:10.
- [15]Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J, Heger A, Holm L, Sonnhammer ELL, Eddy SR, Bateman A, Finn RD: The pfam protein families database. Nucleic Acids Res 2012, 40(Database issue):290-301. http://dx.doi.org/10.1093/nar/gkr1065 webcite
- [16]Li W, Godzik A: Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 2006, 22(13):1658-1659.
- [17]Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. Mol Biol 1990, 215(3):403-410.
- [18]Sonnhammer EL, von Heijne G, Krogh A: A hidden markov model for predicting transmembrane helices in protein sequences. Proc Int Conf Intell Syst Mol Biol 1998, 6:175-182.
- [19]Schiffer M, Edmundson AB: Use of helical wheels to represent the structures of proteins and to identify segments with helical potential. Biophys J 1967, 7:121-135.
- [20]Schuster-Böckler B, Schultz J, Rahman S: Hmm logos for visualization of protein families. 2004. http://dx.doi.org/10.1186/1471-2105-5-7 webcite
- [21]Sigrist CJ, de Castro E, Cerutti L, Cuche BA, Hulo N, Bridge A, Bougueleret L, Xenarios I: New and continuing developments at prosite. Nucleic Acids Res 2013, 41(D1):344-347.
- [22]Sigrist CJ, Cerutti L, Hulo N, Gattiker A, Falquet L, Pagni M, Bairoch A, Bucher P: Prosite: a documented database using patterns and profiles as motif descriptors. Brief Bioinform 2002, 3(3):265-274.
- [23]de Castro E, Sigrist CJ, Gattiker A, Bulliard V, Langendijk-Genevaux PS, Gasteiger E, Bairoch A, Hulo N: Scanprosite: detection of prosite signature matches and prorule-associated functional and structural residues in proteins. Nucleic Acids Res 2006, 34(suppl 2):362-365.
- [24]Sigrist CJ, De Castro E, Langendijk-Genevaux PS, Le Saux V, Bairoch A, Hulo N: Prorule: a new database containing functional and structural information on prosite profiles. Bioinformatics 2005, 21(21):4060-4066.
PDF