BMC Structural Biology | |
Quantification of the impact of PSI:Biology according to the annotations of the determined structures | |
William A McLaughlin1  Elchin S Julfayev1  Paul J DePietro1  | |
[1] Department of Basic Science, The Commonwealth Medical College, 525 Pine Street, Scranton, PA 18509, USA | |
关键词: Structure to function relationships; Scientific partnerships; Protein annotation; Protein annotations; Structural genomics; Protein Structure Initiative; | |
Others : 793749 DOI : 10.1186/1472-6807-13-24 |
|
received in 2013-06-14, accepted in 2013-10-14, 发布年份 2013 | |
![]() |
【 摘 要 】
Background
Protein Structure Initiative:Biology (PSI:Biology) is the third phase of PSI where protein structures are determined in high-throughput to characterize their biological functions. The transition to the third phase entailed the formation of PSI:Biology Partnerships which are composed of structural genomics centers and biomedical science laboratories. We present a method to examine the impact of protein structures determined under the auspices of PSI:Biology by measuring their rates of annotations. The mean numbers of annotations per structure and per residue are examined. These are designed to provide measures of the amount of structure to function connections that can be leveraged from each structure.
Results
One result is that PSI:Biology structures are found to have a higher rate of annotations than structures determined during the first two phases of PSI. A second result is that the subset of PSI:Biology structures determined through PSI:Biology Partnerships have a higher rate of annotations than those determined exclusive of those partnerships. Both results hold when the annotation rates are examined either at the level of the entire protein or for annotations that are known to fall at specific residues within the portion of the protein that has a determined structure.
Conclusions
We conclude that PSI:Biology determines structures that are estimated to have a higher degree of biomedical interest than those determined during the first two phases of PSI based on a broad array of biomedical annotations. For the PSI:Biology Partnerships, we see that there is an associated added value that represents part of the progress toward the goals of PSI:Biology. We interpret the added value to mean that team-based structural biology projects that utilize the expertise and technologies of structural genomics centers together with biological laboratories in the community are conducted in a synergistic manner. We show that the annotation rates can be used in conjunction with established metrics, i.e. the numbers of structures and impact of publication records, to monitor the progress of PSI:Biology towards its goals of examining structure to function connections of high biomedical relevance. The metric provides an objective means to quantify the overall impact of PSI:Biology as it uses biomedical annotations from external sources.
【 授权许可】
2013 DePietro et al.; licensee BioMed Central Ltd.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
20140705055024962.pdf | 459KB | ![]() |
|
Figure 4. | 90KB | Image | ![]() |
Figure 3. | 74KB | Image | ![]() |
Figure 2. | 95KB | Image | ![]() |
Figure 1. | 71KB | Image | ![]() |
【 图 表 】
Figure 1.
Figure 2.
Figure 3.
Figure 4.
【 参考文献 】
- [1]Montelione GT: The Protein Structure Initiative: achievements and visions for the future. F1000 Biol Rep 2012, 4:7.
- [2]Yee A, Pardee K, Christendat D, Savchenko A, Edwards AM, Arrowsmith CH: Structural proteomics: toward high-throughput structural biology as a tool in functional genomics. Acc Chem Res 2003, 36(3):183-189.
- [3]Carter P, Lee D, Orengo C: Target selection in structural genomics projects to increase knowledge of protein structure and function space. Adv Protein Chem Struct Biol 2008, 75:1-52.
- [4]Marsden RL, Orengo CA: Target selection for structural genomics: an overview. Methods Mol Biol 2008, 426:3-25.
- [5]Dessailly BH, Nair R, Jaroszewski L, Fajardo JE, Kouranov A, Lee D, Fiser A, Godzik A, Rost B, Orengo C: PSI-2: structural genomics to cover protein domain family space. Structure 2009, 17(6):869-881.
- [6]Gabanyi MJ, Adams PD, Arnold K, Bordoli L, Carter LG, Flippen-Andersen J, Gifford L, Haas J, Kouranov A, McLaughlin WA: The structural biology knowledgebase: a portal to protein structures, sequences, functions, and methods. J Struct Funct Genomics 2011, 12(2):45-54.
- [7]Arnold K, Kiefer F, Kopp J, Battey JN, Podvinec M, Westbrook JD, Berman HM, Bordoli L, Schwede T: The protein model portal. J Struct Funct Genomics 2009, 10(1):1-8.
- [8]Biasini M, Mariani V, Haas J, Scheuber S, Schenk AD, Schwede T, Philippsen A: OpenStructure: a flexible software framework for computational structural biology. Bioinformatics 2010, 26(20):2626-2628.
- [9]Nair R, Liu J, Soong TT, Acton TB, Everett JK, Kouranov A, Fiser A, Godzik A, Jaroszewski L, Orengo C, et al.: Structural genomics is the largest contributor of novel structural leverage. J Struct Funct Genomics 2009, 10(2):181-191.
- [10]Cormier CY, Park JG, Fiacco M, Steel J, Hunter P, Kramer J, Singla R, LaBaer J: PSI: Biology-materials repository: a biologist’s resource for protein expression plasmids. J Struct Funct Genomics 2011, 12(2):55-62.
- [11]Duan L, Jaroszewski L, Minor W, Fratczak Z, Grabowski M, Godzik A: PSI impact: Ex-cited Use of PSI structures. In PSI:Nature Structural Biology Knowledgebase Research Advances. Nature Publishing Group; 2012. http://sbkb.org/update/research/psi-impact-ex-cited-use-of-psi-structures webcite
- [12]PSI metrics. http://sbkb.org/metrics/ webcite
- [13]Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, et al.: The Universal Protein Resource (UniProt). Nucleic Acids Res 2005, 33(Database issue):D154-159.
- [14]Sayers EW, Barrett T, Benson DA, Bolton E, Bryant SH, Canese K, Chetvernin V, Church DM, DiCuccio M, Federhen S: Database resources of the national center for biotechnology information. Nucleic Acids Res 2011, 39(Database issue):D38-51. http://nar.oxfordjournals.org/content/39/suppl_1/D38.long webcite
- [15]Kouranov A, Xie L, de la Cruz J, Chen L, Westbrook J, Bourne PE, Berman HM: The RCSB PDB information portal for structural genomics. Nucleic Acids Res 2006, 34(Database issue):D302-305.
- [16]Orphanet. http://www.orpha.net/consor/cgi-bin/index.php webcite
- [17]Karanicolas J, Corn JE, Chen I, Joachimiak LA, Dym O, Peck SH, Albeck S, Unger T, Hu W, Liu G: A de novo protein binding pair by computational design and directed evolution. Mol Cell 2011, 42(2):250-260.
- [18]Schaefer CF, Anthony K, Krupa S, Buchoff J, Day M, Hannay T, Buetow KH: PID: the Pathway Interaction Database. Nucleic Acids Res 2009, 37(Database issue):D674-679.
- [19]Brett D, Pospisil H, Valcárcel J, Reich J, Bork P: Alternative splicing and genome complexity. Nat Genet 2002, 30(1):29.
- [20]Gerlt JA, Allen KN, Almo SC, Armstrong RN, Babbitt PC, Cronan JE, Dunaway-Mariano D, Imker HJ, Jacobson MP, Minor W, et al.: The enzyme function initiative. Biochemistry 2011, 50(46):9950-9962.
- [21]Magrane M, Consortium U: UniProt knowledgebase: a hub of integrated protein data. Database (Oxford) 2011, 2011:bar009. http://europepmc.org/articles/PMC3070428 webcite
- [22]Pagliarini DJ, Calvo SE, Chang B, Sheth SA, Vafai SB, Ong S-E, Walford GA, Sugiana C, Boneh A, Chen WK: A mitochondrial protein compendium elucidates complex I disease biology. Cell 2008, 134(1):112-123.
- [23]Romero PR, Zaidi S, Fang YY, Uversky VN, Radivojac P, Oldfield CJ, Cortese MS, Sickmeier M, LeGall T, Obradovic Z: Alternative splicing in concert with protein intrinsic disorder enables increased functional diversity in multicellular organisms. Proc Natl Acad Sci 2006, 103(22):8390-8395.
- [24]Huang YJ, Hang D, Lu LJ, Tong L, Gerstein MB, Montelione GT: Targeting the human cancer pathway protein interaction network by structural genomics. Mol Cell Proteomics 2008, 7(10):2048-2060.
- [25]PSI:Biology impact study. http://protein.tcmedc.org/Charts/impact.jsp webcite
- [26]Karp PD, Ouzounis CA, Moore-Kochlacs C, Goldovsky L, Kaipa P, Ahren D, Tsoka S, Darzentas N, Kunin V, Lopez-Bigas N: Expansion of the BioCyc collection of pathway/genome databases to 160 genomes. Nucleic Acids Res 2005, 33(19):6083-6089.
- [27]Fukuda K: INOH pathway database: curation, annotation, integration. InterOntology08 2008, 1(1):47-50.
- [28]Degtyarenko K, de Matos P, Ennis M, Hastings J, Zbinden M, McNaught A, Alcantara R, Darsow M, Guedj M, Ashburner M: ChEBI: a database and ontology for chemical entities of biological interest. Nucleic Acids Res 2008, 36(Database issue):D344-350.
- [29]Dwinell MR, Worthey EA, Shimoyama M, Bakir-Gungor B, DePons J, Laulederkind S, Lowry T, Nigram R, Petri V, Smith J: The Rat genome database 2009: variation, ontologies and pathways. Nucleic Acids Res 2009, 37(suppl 1):D744-D749.
- [30]Blake JA, Bult CJ, Kadin JA, Richardson JE, Eppig JT: The Mouse Genome Database (MGD): premier model organism resource for mammalian genomics and genetics. Nucleic Acids Res 2011, 39(suppl 1):D842-D848.
- [31]McKusick VA: Mendelian inheritance in Man and its online version, OMIM. Am J Hum Genet 2007, 80(4):588-604.
- [32]Dimmer EC, Huntley RP, Alam-Faruque Y, Sawford T, O’Donovan C, Martin MJ, Bely B, Browne P, Chan WM, Eberhardt R: The UniProt-GO annotation database in 2011. Nucleic Acids Res 2012, 40(D1):D565-D570.
- [33]Velankar S, Dana JM, Jacobsen J, van Ginkel G, Gane PJ, Luo J, Oldfield TJ, O’Donovan C, Martin M-J, Kleywegt GJ: SIFTS: structure integration with function, taxonomy and sequences resource. Nucleic Acids Res 2013, 41(D1):D483-D489.
- [34]Chen L, Oughtred R, Berman HM, Westbrook J: TargetDB: a target registration database for structural genomics projects. Bioinformatics 2004, 20(16):2860-2862.
- [35]TargetTrack. http://sbkb.org/tt/ webcite