| Viruses | |
| Clustering of Giant Virus-DNA Based on Variations in Local Entropy | |
| Ranjan Bose1  Gerhard Thiel2  | |
| [1] Department of Electrical Engineering, IIT Delhi, Hauz Khas, New Delhi 110016, India; E-Mail:;Department of Biology, Technische Universität Darmstadt, 64287 Darmstadt, Germany; E-Mail: | |
| 关键词: information theory; genomic sequences; evolution; phylogeny; virus; | |
| DOI : 10.3390/v6062259 | |
| 来源: mdpi | |
PDF
|
|
【 摘 要 】
We present a method for clustering genomic sequences based on variations in local entropy. We have analyzed the distributions of the block entropies of viruses and plant genomes. A distinct pattern for viruses and plant genomes is observed. These distributions, which describe the local entropic variability of the genomes, are used for clustering the genomes based on the Jensen-Shannon (JS) distances. The analysis of the JS distances between all genomes that infect the chlorella algae shows the host specificity of the viruses. We illustrate the efficacy of this entropy-based clustering technique by the segregation of plant and virus genomes into separate bins.
【 授权许可】
CC BY
© 2014 by the authors; licensee MDPI, Basel, Switzerland.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO202003190025292ZK.pdf | 347KB |
PDF