BMC Bioinformatics | |
Phylo_dCor: distance correlation as a novel metric for phylogenetic profiling | |
Software | |
Gabriella Sferra1  Marta Ponzi1  Federica Fratini1  Elisabetta Pizzi1  | |
[1] Dipartimento di Malattie Infettive, Parassitarie e Immunomediate, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161, Rome, Italy; | |
关键词: Phylogenetic profiling; Distance correlation; Protein-protein interaction; | |
DOI : 10.1186/s12859-017-1815-5 | |
received in 2017-05-18, accepted in 2017-08-29, 发布年份 2017 | |
来源: Springer | |
【 摘 要 】
BackgroundElaboration of powerful methods to predict functional and/or physical protein-protein interactions from genome sequence is one of the main tasks in the post-genomic era. Phylogenetic profiling allows the prediction of protein-protein interactions at a whole genome level in both Prokaryotes and Eukaryotes. For this reason it is considered one of the most promising methods.ResultsHere, we propose an improvement of phylogenetic profiling that enables handling of large genomic datasets and infer global protein-protein interactions. This method uses the distance correlation as a new measure of phylogenetic profile similarity. We constructed robust reference sets and developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation that makes it applicable to large genomic data. Using Saccharomyces cerevisiae and Escherichia coli genome datasets, we showed that Phylo-dCor outperforms phylogenetic profiling methods previously described based on the mutual information and Pearson’s correlation as measures of profile similarity.ConclusionsIn this work, we constructed and assessed robust reference sets and propose the distance correlation as a measure for comparing phylogenetic profiles. To make it applicable to large genomic data, we developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation. Two R scripts that can be run on a wide range of machines are available upon request.
【 授权许可】
CC BY
© The Author(s). 2017
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202311095816136ZK.pdf | 734KB | download | |
12888_2017_1559_Article_IEq1.gif | 1KB | Image | download |
12864_2017_3487_Article_IEq41.gif | 1KB | Image | download |
12864_2017_3661_Article_IEq5.gif | 1KB | Image | download |
12864_2017_3898_Article_IEq2.gif | 1KB | Image | download |
12864_2017_3645_Article_IEq4.gif | 1KB | Image | download |
【 图 表 】
12864_2017_3645_Article_IEq4.gif
12864_2017_3898_Article_IEq2.gif
12864_2017_3661_Article_IEq5.gif
12864_2017_3487_Article_IEq41.gif
12888_2017_1559_Article_IEq1.gif
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]