期刊论文详细信息
BMC Systems Biology
An effective method for refining predicted protein complexes based on protein activity and the mechanism of protein complex formation
Yi Pan2  Min Li1  Qianghua Xiao1  Xiaoqing Peng1  Jianxin Wang1 
[1] School of Information Science and Engineering, Central South University, Changsha 410083, China;Department of Computer Science, Georgia State University, Atlanta, GA 30302-4110, USA
关键词: Gene expression;    Refining;    Just-in-time;    Protein complex formation model;    Protein activity;   
Others  :  1142946
DOI  :  10.1186/1752-0509-7-28
 received in 2012-10-09, accepted in 2013-03-14,  发布年份 2013
PDF
【 摘 要 】

Background

Identifying protein complexes from protein-protein interaction network is fundamental for understanding the mechanism of cellular component and protein function. At present, many methods to identify protein complexes are mainly based on the topological characteristics or the functional similarity features, neglecting the fact that proteins must be in their active forms to interact with others and the formation of protein complex is following a just-in-time mechanism.

Results

This paper firstly presents a protein complex formation model based on the just-in-time mechanism. By investigating known protein complexes combined with gene expression data, we find that most protein complexes can be formed in continuous time points, and the average overlapping rate of the known complexes during the formation is large. A method is proposed to refine the protein complexes predicted by clustering algorithms based on the protein complex formation model and the properties of known protein complexes. After refinement, the number of known complexes that are matched by predicted complexes, Sensitivity, Specificity, and f-measure are significantly improved, when compared with those of the original predicted complexes.

Conclusion

The refining method can discard the spurious proteins by protein activity and generate new complexes by just-in-time assemble mechanism, which can enhance the ability to predict complex.

【 授权许可】

   
2013 Wang et al.; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20150324133721127.pdf 265KB PDF download
Figure 7. 59KB Image download
Figure 6. 53KB Image download
Figure 5. 41KB Image download
Fig. 6. 20KB Image download
Figure 3. 59KB Image download
Figure 2. 19KB Image download
Figure 1. 35KB Image download
【 图 表 】

Figure 1.

Figure 2.

Figure 3.

Fig. 6.

Figure 5.

Figure 6.

Figure 7.

【 参考文献 】
  • [1]Barabási A, Oltvai Z: Network biology: understanding the cell’s functional organization. Nat Rev Genet 2004, 5(2):101-113.
  • [2]Garrels J: Yeast genomic databases and the challenge of the post-genomic era. Funct & Integr genomics 2002, 2(4):212-237.
  • [3]Gavin A, Superti-Furga G, et al.: Protein complexes and proteome organization from yeast to man. Curr Opin Chem Biol 2003, 7:21.
  • [4]Li M, Chen J, Wang J, Hu B, Chen G: Modifying the DPClus algorithm for identifying protein complexes based on new topological structures. BMC bioinformatics 2008, 9:398. BioMed Central Full Text
  • [5]Amoutzias G, de Peer YV: Single-gene and whole-genome duplications and the evolution of protein-protein interaction networks. In Evolutionary Genomics and Systems Biology. Edited by Gustavo Caetano-Anollés. Hoboken: Wiley-Blackwell; 2010:413-429.
  • [6]Wang J, Li M, Deng Y, Pan Y: Recent advances in clustering methods for protein interaction networks. BMC genomics 2010, 11(Suppl 3):S10. BioMed Central Full Text
  • [7]Hartuv E, Shamir R: A clustering algorithm based on graph connectivity. Inf Process Lett 2000, 76(4):175-181.
  • [8]Girvan M, Newman ME: Community structure in social and biological networks. Proceedings of the National Academy of Sciences 2002, 99(12):7821-7826.
  • [9]Spirin V, Mirny L: Protein complexes and functional modules in molecular networks. Proceedings of the National Academy of Sciences 2003, 100(21):12123-12128.
  • [10]King A, Pržulj N, Jurisica I: Protein complex prediction via cost-based clustering. Bioinformatics 2004, 20(17):3013-3020.
  • [11]Radicchi F, Castellano C, Cecconi F, Loreto V, Parisi D: Defining and identifying communities in networks. Proceedings of the National Academy of Sciences of the United States of America 2004, 101(9):2658-2663.
  • [12]Newman M: Fast algorithm for detecting community structure in networks. Phys Rev E 2004, 69(6):066133.
  • [13]Veraksa A, Bauer A, Artavanis-Tsakonas S: Analyzing protein complexes in Drosophila with tandem affinity purification–mass spectrometry. Dev Dyn 2005, 232(3):827-834.
  • [14]Adamcsek B, Palla G, Farkas I, Derényi I, Vicsek T: CFinder: locating cliques and overlapping modules in biological networks. Bioinformatics 2006, 22(8):1021-1023.
  • [15]Li X, Tan S, Foo C, Ng S, et al.: Interaction graph mining for protein complexes using local clique merging. GENOME INFORM SER 2005, 16(2):260-269.
  • [16]Altaf-Ul-Amin M, Shinbo Y, Mihara K, Kurokawa K, Kanaya S: Development and implementation of an algorithm for detection of protein complexes in large interaction networks. BMC bioinformatics 2006, 7:207-219. BioMed Central Full Text
  • [17]Rungsarityotin W, Krause R, Schödl A, Schliep A: Identifying protein complexes directly from high-throughput TAP data with Markov random fields. BMC bioinformatics 2007, 8:482. BioMed Central Full Text
  • [18]Cho Y, Hwang W, Ramanathan M, Zhang A: Semantic integration to identify overlapping functional modules in protein interaction networks. BMC bioinformatics 2007, 8:265. BioMed Central Full Text
  • [19]Luo F, Yang Y, Chen C, Chang R, Zhou J, Scheuermann R: Modular organization of protein interaction networks. Bioinformatics 2007, 23(2):207-214.
  • [20]Zhang S, Wang R, Zhang X: Identification of overlapping community structure in complex networks using fuzzy c-means clustering. Physica A: Stat Mech Appl 2007, 374:483-490.
  • [21]Leung H, Xiang Q, Yiu S, Chin F: Predicting protein complexes from PPI data: a core-attachment approach. J Comput Biol 2009, 16(2):133-144.
  • [22]Liu G, Wong L, Chua H: Complex discovery from weighted PPI networks. Bioinformatics 2009, 25(15):1891-1897.
  • [23]Lancichinetti A, Fortunato S, Kertész J: Detecting the overlapping and hierarchical community structure in complex networks. New J Phys 2009, 11(3):033015.
  • [24]Wang J, Liu B, Li M, Pan Y: Identifying protein complexes from interaction networks based on clique percolation and distance restriction. BMC Genomics 2010, 11(Suppl 2):S10. BioMed Central Full Text
  • [25]Jiang P, Singh M: SPICi: a fast clustering algorithm for large biological networks. Bioinformatics 2010, 26(8):1105-1111.
  • [26]Wang J, Li M, Chen J, Pan Y: A fast hierarchical clustering algorithm for functional modules discovery in protein interaction networksDavid00. Comput Biol Bioinformatics, IEEE/ACM Transactions on 2011, 8(3):607-620.
  • [27]Kenley E, Cho Y: Entropy-based graph clustering: Application to biological and social networks. In Data Mining (ICDM), 2011 IEEE 11th International Conference on. : IEEE; 2011:1116-1121.
  • [28]Chen B, Shi J, Zhang S, Wu F: Identifying protein complexes in protein–protein interaction networks by using clique seeds and graph entropy. Proteomics 2013, 13(2):269-277.
  • [29]Palla G, Derényi I, Farkas I, Vicsek T: Uncovering the overlapping community structure of complex networks in nature and society. Nature 2005, 435(7043):814-818.
  • [30]Bader G, Hogue C: An automated method for finding molecular complexes in large protein interaction networks. BMC bioinformatics 2003, 4:2. BioMed Central Full Text
  • [31]Eisenberg D, Marcotte E, Xenarios I, Yeates T, et al.: Protein function in the post-genomic era. NATURE-LONDON- 2000, 405:823-826.
  • [32]Pržulj N, Wigle D, Jurisica I: Functional topology in a network of protein interactions. Bioinformatics 2004, 20(3):340-348.
  • [33]de Lichtenberg U, Jensen L, Brunak S, Bork P: Dynamic complex formation during the yeast cell cycle. Sci Signal 2005, 307(5710):724-727.
  • [34]Komurov K, White M: Revealing static and dynamic modular architecture of the eukaryotic protein interaction network. Mol Syst Biol 2007, 3:110.
  • [35]Petsko G, Ringe D: Protein structure and function. : Sinauer Associates Inc; 2004.
  • [36]Pu S, Wong J, Turner B, Cho E, Wodak S: Up-to-date catalogues of yeast protein complexes. Nucleic Acids Res 2009, 37(3):825-831.
  • [37]Xenarios I, Salwinski L, Duan X, Higney P, Kim S, Eisenberg D: DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions. Nucleic Acids Res 2002, 30:303-305.
  • [38]Tu B, Kudlicki A, Rowicka M, McKnight S: Logic of the yeast metabolic cycle: temporal compartmentalization of cellular processes. Science 2005, 310(5751):1152-1158.
  • [39]Wang J, Peng X, Li M, Pan Y: Construction and application of dynamic protein interaction network based on time course gene expression data. Proteomics 2013, 13(2):301-312.
  • [40]van Dongen S: Graph clustering by flow simulation. PhD thesis. University of Utrecht 2000
  • [41]Enright A, Van Dongen S, Ouzounis C: An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res 2002, 30(7):1575-1584.
  文献评价指标  
  下载次数:53次 浏览次数:15次