| BMC Bioinformatics | |
| dConsensus: a tool for displaying domain assignments by multiple structure-based algorithms and for construction of a consensus assignment | |
| Kieran Alden2  Stella Veretnik3  Philip E Bourne1  | |
| [1] Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, 9500 Gilman Dr., La Jolla, CA 92093-0636, USA | |
| [2] York Centre for Complex Systems Analysis (YCCSA), University of York, Heslington, York, YO10 5DD, UK | |
| [3] San Diego Supercomputer Center, University of California San Diego, 9500 Gilman Dr., La Jolla, CA 92093-0743, USA | |
| Others : 1165268 DOI : 10.1186/1471-2105-11-310 |
|
| received in 2010-01-14, accepted in 2010-06-09, 发布年份 2010 | |
PDF
|
|
【 摘 要 】
Background
Partitioning of a protein into structural components, known as domains, is an important initial step in protein classification and for functional and evolutionary studies. While the systematic assignments of domains by human experts exist (CATH and SCOP), the introduction of high throughput technologies for structure determination threatens to overwhelm expert approaches. A variety of algorithmic methods have been developed to expedite this process, allowing almost instant structural decomposition into domains. The performance of algorithmic methods can approach 85% agreement on the number of domains with the consensus reached by experts. However, each algorithm takes a somewhat different conceptual approach, each with unique strengths and weaknesses. Currently there is no simple way to automatically compare assignments from different structure-based domain assignment methods, thereby providing a comprehensive understanding of possible structure partitioning as well as providing some insight into the tendencies of particular algorithms. Most importantly, a consensus assignment drawn from multiple assignment methods can provide a singular and presumably more accurate view.
Results
We introduce dConsensus http://pdomains.sdsc.edu/dConsensus webcite; a web resource that displays the results of calculations from multiple algorithmic methods and generates a domain assignment consensus with an associated reliability score. Domain assignments from seven structure-based algorithms - PDP, PUU, DomainParser2, NCBI method, DHcL, DDomains and Dodis are available for analysis and comparison alongside assignments made by expert methods. The assignments are available for all protein chains in the Protein Data Bank (PDB). A consensus domain assignment is built by either allowing each algorithm to contribute equally (simple approach) or by weighting the contribution of each method by its prior performance and observed tendencies. An analysis of secondary structure around domain and fragment boundaries is also available for display and further analysis.
Conclusion
dConsensus provides a comprehensive assignment of protein domains. For the first time, seven algorithmic methods are brought together with no need to access each method separately via a webserver or local copy of the software. This aggregation permits a consensus domain assignment to be computed. Comparison viewing of the consensus and choice methods provides the user with insights into the fundamental units of protein structure so important to the study of evolutionary and functional relationships.
【 授权许可】
2010 Alden et al; licensee BioMed Central Ltd.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| 20150416025425299.pdf | 1391KB | ||
| Figure 2. | 49KB | Image | |
| Figure 1. | 114KB | Image |
【 图 表 】
Figure 1.
Figure 2.
【 参考文献 】
- [1]Wetlaufer DB: Nucleation, rapid folding, and globular intrachain regions in proteins. Proc Natl Acad Sci USA 1973, 70:697-701.
- [2]Rossman MG, Liljas A: Letter: Recognition of structural domains in globular proteins. J Mol Biol 1974, 85:177-181.
- [3]Veretnik S, Shindyalov IN: Computational methods for domain partitioning in protein structures. In Computational Methods for Protein Structure Prediction and Modeling. Edited by Xu Y, Xu D, Liang J. Springer; 2006:125-145.
- [4]Greene LH, Lewis TE, Addou S, Cuff A, Dallman T, Dibley M, Redfern O, Pearl F, Nambudiry R, Reid A, Sillitoe I, Yeats C, Thornton JM, Orengo CA: The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution. Nucleic Acids Res 2007, 35:D291-297.
- [5]Andreeva A, Howorth D, Brenner SE, Hubbard TJP, Chothia C, Murzin AG: SCOP database in 2004: refinements integrate structure and sequence family data. Nucleic Acids Res 2004, 32:D226-229.
- [6]Veretnik S, Gu J, Wodak S: Identifying Structural Domains in Proteins. In In Genny Gu and Philip Bourne Structural Bioinformatics. Second edition. Wiley-Blackwell; 2009:485-513.
- [7]Veretnik S, Bourne PE, Alexandrov NN, Shindyalov IN: Toward consistent assignment of structural domains in proteins. J Mol Biol 2004, 339:647-678.
- [8]Holland TA, Veretnik S, Shindyalov IN, Bourne PE: Partitioning protein structures into domains: why is it so difficult? J Mol Biol 2006, 361:562-590.
- [9]Jones S, Stewart M, Michie A, Swindells MB, Orengo C, Thornton JM: Domain assignment for protein structures using a consensus approach: characterization and analysis. Protein Sci 1998, 7:233-242.
- [10]Orengo CA, Michie AD, Jones S, Jones DT, Swindells MB, Thornton JM: CATH--a hierarchic classification of protein domain structures. Structure 1997, 5:1093-1108.
- [11]Murzin AG, Brenner SE, Hubbard T, Chothia C: SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 1995, 247:536-540.
- [12]Alexandrov N, Shindyalov I: PDP: protein domain parser. Bioinformatics 2003, 19:429-430.
- [13]Guo J, Xu D, Kim D, Xu Y: Improving the performance of DomainParser for structural domain partition using neural network. Nucleic Acids Res 2003, 31:944-952.
- [14]Holm L, Sander C: Parser for protein folding units. Proteins 1994, 19:256-268.
- [15]Zhou H, Xue B, Zhou Y: DDOMAIN: Dividing structures into domains using a normalized domain-domain interaction profile. Protein Sci 2007, 16:947-955.
- [16]Madej T, Gibrat JF, Bryant SH: Threading a database of protein cores. Proteins 1995, 23:356-369.
- [17]Koczyk G, Berezovsky IN: Domain Hierarchy and closed Loops (DHcL): a server for exploring hierarchy of protein domain structure. Nucleic Acids Res 2008, 36:W239-245.
- [18]Carugo O: Identification of domains in protein crystal structures. Journal of Applied Crystallography 2007, 40:778-781.
- [19]Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Res 2000, 28:235-242.
PDF