期刊论文详细信息
Journal of computational biology
Predicting Protein Functions Based on Differential Co-expression and Neighborhood Analysis
article
Jael Sanyanda Wekesa1  Yushi Luan3  Jun Meng1 
[1] School of Computer Science and Technology, Dalian University of Technology;School of Computing and Information Technology, Jomo Kenyatta University of Agriculture and Technology;School of Life Science and Biotechnology, Dalian University of Technology
关键词: differential co-expression;    function prediction;    gene expression profile;    protein;    protein interaction;   
DOI  :  10.1089/cmb.2019.0120
来源: Mary Ann Liebert, Inc. Publishers
PDF
【 摘 要 】

Proteins are polypeptides essential in biological processes. Protein physical interactions are complemented by other types of functional relationship data including genetic interactions, knowledge about co-expression, and evolutionary pathways. Existing algorithms integrate protein interaction and gene expression data to retrieve context-specific subnetworks composed of genes/proteins with known and unknown functions. However, most protein function prediction algorithms fail to exploit diverse intrinsic information in feature and label spaces. We develop a novel integrative method based on differential Co-expression analysis and Neighbor-voting algorithm for Protein Function Prediction, namely CNPFP. The method integrates heterogeneous data and exploits intrinsic and latent linkages via global iterative approach and genomic features. CNPFP performs three tasks: clustering, differential co-expression analysis, and predicts protein functions. Our aim is to identify yeast cell cycle-specific proteins linked to differentially expressed proteins in the protein–protein interaction network. To capture intrinsic information, CNPFP selects the most relevant feature subset based on global iterative neighbor-voting algorithm. We identify eight condition-specific modules. The most relevant subnetwork has 87 genes highly enriched with cyclin-dependent kinases, a protein kinase relevant for cell cycle regulation. We present comprehensive annotations for 3538 Saccharomyces cerevisiae proteins. Our method achieves an AUROC of 0.9862, accuracy of 0.9710, and F-score of 0.9691. From the results, we can summarize that exploiting intrinsic nature of protein relationships improves the quality of function prediction. Thus, the proposed method is useful in functional genomics studies.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO202108110003417ZK.pdf 813KB PDF download
  文献评价指标  
  下载次数:3次 浏览次数:0次