BMC Bioinformatics | |
JED: a Java Essential Dynamics Program for comparative analysis of protein trajectories | |
Software | |
Charles C. David1  Ettayapuram Ramaprasad Azhagiya Singam2  Donald J. Jacobs3  | |
[1] Department of Bioinformatics and Genomics, University of North Carolina, Charlotte, USA;Current Address: The New Zealand Institute for Plant & Food Research, Limited, Lincoln, New Zealand;Department of Physics and Optical Science, University of North Carolina, Charlotte, USA;Department of Physics and Optical Science, University of North Carolina, Charlotte, USA;Center for Biomedical Engineering and Science, University of North Carolina, Charlotte, USA; | |
关键词: Essential dynamics; Principal component analysis; Distance pairs; Partial correlations; Vector space comparison; Principal angles; | |
DOI : 10.1186/s12859-017-1676-y | |
received in 2017-02-02, accepted in 2017-05-03, 发布年份 2017 | |
来源: Springer | |
【 摘 要 】
BackgroundEssential Dynamics (ED) is a common application of principal component analysis (PCA) to extract biologically relevant motions from atomic trajectories of proteins. Covariance and correlation based PCA are two common approaches to determine PCA modes (eigenvectors) and their eigenvalues. Protein dynamics can be characterized in terms of Cartesian coordinates or internal distance pairs. In understanding protein dynamics, a comparison of trajectories taken from a set of proteins for similarity assessment provides insight into conserved mechanisms. Comprehensive software is needed to facilitate comparative-analysis with user-friendly features that are rooted in best practices from multivariate statistics.ResultsWe developed a Java based Essential Dynamics toolkit called JED to compare the ED from multiple protein trajectories. Trajectories from different simulations and different proteins can be pooled for comparative studies. JED implements Cartesian-based coordinates (cPCA) and internal distance pair coordinates (dpPCA) as options to construct covariance (Q) or correlation (R) matrices. Statistical methods are implemented for treating outliers, benchmarking sampling adequacy, characterizing the precision of Q and R, and reporting partial correlations. JED output results as text files that include transformed coordinates for aligned structures, several metrics that quantify protein mobility, PCA modes with their eigenvalues, and displacement vector (DV) projections onto the top principal modes. Pymol scripts together with PDB files allow movies of individual Q- and R-cPCA modes to be visualized, and the essential dynamics occurring within user-selected time scales. Subspaces defined by the top eigenvectors are compared using several statistical metrics to quantify similarity/overlap of high dimensional vector spaces. Free energy landscapes can be generated for both cPCA and dpPCA.ConclusionsJED offers a convenient toolkit that encourages best practices in applying multivariate statistics methods to perform comparative studies of essential dynamics over multiple proteins. For each protein, Cartesian coordinates or internal distance pairs can be employed over the entire structure or user-selected parts to quantify similarity/differences in mobility and correlations in dynamics to develop insight into protein structure/function relationships.
【 授权许可】
CC BY
© The Author(s). 2017
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202311104600794ZK.pdf | 2898KB | download | |
MediaObjects/13046_2023_2865_MOESM10_ESM.jpg | 226KB | Other | download |
Fig. 1 | 1445KB | Image | download |
Fig. 4 | 532KB | Image | download |
Fig. 2 | 1809KB | Image | download |
Fig. 3 | 251KB | Image | download |
Fig. 4 | 632KB | Image | download |
【 图 表 】
Fig. 4
Fig. 3
Fig. 2
Fig. 4
Fig. 1
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]