Proteome Science | |
Fully automated protein complex prediction based on topological similarity and community structure | |
Research | |
Chengwei Lei1  Jianhua Ruan1  Saleh Tamim1  Alexander JR Bishop2  | |
[1] Department of Computer Science, The University of Texas at San Antonio, 78249, San Antonio, TX, USA;Greehey Children's Cancer Research Institute, The University of Texas Health Science Center at San Antonio, 78229, San Antonio, TX, USA;Department of Cellular and Structural Biology, The University of Texas Health Science Center at San Antonio, 78229, San Antonio, TX, USA; | |
关键词: PPI network; random walk; protein-protein interaction; protein complex; clustering; | |
DOI : 10.1186/1477-5956-11-S1-S9 | |
来源: Springer | |
【 摘 要 】
To understand the function of protein complexes and their association with biological processes, a lot of studies have been done towards analyzing the protein-protein interaction (PPI) networks. However, the advancement in high-throughput technology has resulted in a humongous amount of data for analysis. Moreover, high level of noise, sparseness, and skewness in degree distribution of PPI networks limits the performance of many clustering algorithms and further analysis of their interactions.In addressing and solving these problems we present a novel random walk based algorithm that converts the incomplete and binary PPI network into a protein-protein topological similarity matrix (PP-TS matrix). We believe that if two proteins share some high-order topological similarities they are likely to be interacting with each other. Using the obtained PP-TS matrix, we constructed and used weighted networks to further study and analyze the interaction among proteins. Specifically, we applied a fully automated community structure finding algorithm (Auto-HQcut) on the obtained weighted network to cluster protein complexes. We then analyzed the protein complexes for significance in biological processes. To help visualize and analyze these protein complexes we also developed an interface that displays the resulting complexes as well as the characteristics associated with each complex.Applying our approach to a yeast protein-protein interaction network, we found that the predicted protein-protein interaction pairs with high topological similarities have more significant biological relevance than the original protein-protein interactions pairs. When we compared our PPI network reconstruction algorithm with other existing algorithms using gene ontology and gene co-expression, our algorithm produced the highest similarity scores. Also, our predicted protein complexes showed higher accuracy measure compared to the other protein complex predictions.
【 授权许可】
CC BY
© Lei et al; licensee BioMed Central Ltd. 2013
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202311101807191ZK.pdf | 1137KB | download |
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]