| Genes | |
| Clustering Rfam 10.1: Clans, Families, and Classes | |
| Felipe A. Lessa2  Tainá Raiol3  Marcelo M. Brigido3  Daniele S. B. Martins Neto1  Maria Emília M. T. Walter2  | |
| [1] Department of Mathematics, University of Brasília, Brasília 70910-900, Brazil; E-Mail:;Department of Computer Science, Institute of Exact Sciences, University of Brasília, Brasília 70910-900, Brazil; E-Mail:;Department of Cellular Biology, Institute of Biology, University of Brasília, Brasília 70910-900, Brazil; E-Mails: | |
| 关键词: Rfam; non-coding RNA; secondary structure; clans; clusters; | |
| DOI : 10.3390/genes3030378 | |
| 来源: mdpi | |
PDF
|
|
【 摘 要 】
The Rfam database contains information about non-coding RNAs emphasizing their secondary structures and organizing them into families of homologous RNA genes or functional RNA elements. Recently, a higher order organization of Rfam in terms of the so-called clans was proposed along with its “decimal release”. In this proposition, some of the families have been assigned to clans based on experimental and computational data in order to find related families. In the present work we investigate an alternative classification for the RNA families based on tree edit distance. The resulting clustering recovers some of the Rfam clans. The majority of clans, however, are not recovered by the structural clustering. Instead, they get dispersed into larger clusters, which correspond roughly to Genes 2012, 3 379 well-described RNA classes such as snoRNAs, miRNAs, and CRISPRs. In conclusion, a structure-based clustering can contribute to the elucidation of the relationships among the Rfam families beyond the realm of clans and classes.
【 授权许可】
CC BY
© 2012 by the authors; licensee MDPI, Basel, Switzerland.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO202003190043281ZK.pdf | 3062KB |
PDF