BMC Bioinformatics | |
Automatic generation of pseudoknotted RNAs taxonomy | |
Research | |
Luca Tesei1  Emanuela Merelli1  Michela Quadrini1  | |
[1] School of Sciences and Technology, University of Camerino, Via Madonna delle Carceri 7, 62032, Camerino, MC, Italy; | |
关键词: RNA secondary structures; Evaluation framework; Benchmark; RNA comparison methods; Agglomerative clustering; | |
DOI : 10.1186/s12859-023-05362-5 | |
received in 2022-02-18, accepted in 2023-05-25, 发布年份 2023 | |
来源: Springer | |
【 摘 要 】
BackgroundThe ability to compare RNA secondary structures is important in understanding their biological function and for grouping similar organisms into families by looking at evolutionarily conserved sequences such as 16S rRNA. Most comparison methods and benchmarks in the literature focus on pseudoknot-free structures due to the difficulty of mapping pseudoknots in classical tree representations. Some approaches exist that permit to cluster pseudoknotted RNAs but there is not a general framework for evaluating their performance.ResultsWe introduce an evaluation framework based on a similarity/dissimilarity measure obtained by a comparison method and agglomerative clustering. Their combination automatically partition a set of molecules into groups. To illustrate the framework we define and make available a benchmark of pseudoknotted (16S and 23S) and pseudoknot-free (5S) rRNA secondary structures belonging to Archaea, Bacteria and Eukaryota. We also consider five different comparison methods from the literature that are able to manage pseudoknots. For each method we clusterize the molecules in the benchmark to obtain the taxa at the rank phylum according to the European Nucleotide Archive curated taxonomy. We compute appropriate metrics for each method and we compare their suitability to reconstruct the taxa.
【 授权许可】
CC BY
© The Author(s) 2023
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202309070349680ZK.pdf | 1517KB | download | |
41116_2023_37_Article_IEq236.gif | 1KB | Image | download |
MediaObjects/12888_2023_4925_MOESM1_ESM.docx | 16KB | Other | download |
13011_2023_540_Article_IEq1.gif | 1KB | Image | download |
Fig. 1 | 526KB | Image | download |
40517_2023_259_Article_IEq27.gif | 1KB | Image | download |
Fig. 1 | 661KB | Image | download |
40517_2023_259_Article_IEq86.gif | 1KB | Image | download |
40517_2023_259_Article_IEq62.gif | 1KB | Image | download |
MediaObjects/13690_2023_1105_MOESM2_ESM.docx | 58KB | Other | download |
MediaObjects/12888_2023_4848_MOESM2_ESM.docx | 15KB | Other | download |
MediaObjects/12888_2023_4848_MOESM3_ESM.docx | 236KB | Other | download |
40517_2023_259_Article_IEq67.gif | 1KB | Image | download |
Fig. 2 | 119KB | Image | download |
40517_2023_259_Article_IEq70.gif | 1KB | Image | download |
40517_2023_259_Article_IEq71.gif | 1KB | Image | download |
40517_2023_259_Article_IEq72.gif | 1KB | Image | download |
40517_2023_259_Article_IEq73.gif | 1KB | Image | download |
Fig. 3 | 233KB | Image | download |
40517_2023_259_Article_IEq75.gif | 1KB | Image | download |
40517_2023_259_Article_IEq76.gif | 1KB | Image | download |
40517_2023_259_Article_IEq77.gif | 1KB | Image | download |
40517_2023_259_Article_IEq78.gif | 1KB | Image | download |
40517_2023_259_Article_IEq79.gif | 1KB | Image | download |
Fig. 4 | 1304KB | Image | download |
MediaObjects/41408_2023_863_MOESM4_ESM.pdf | 39018KB | download | |
40517_2023_259_Article_IEq82.gif | 1KB | Image | download |
40517_2023_259_Article_IEq83.gif | 1KB | Image | download |
Fig. 1 | 98KB | Image | download |
MediaObjects/12974_2023_2790_MOESM2_ESM.tif | 140748KB | Other | download |
Fig. 3 | 250KB | Image | download |
MediaObjects/13041_2023_1044_MOESM4_ESM.csv | 551KB | Other | download |
Fig. 1 | 71KB | Image | download |
Fig. 2 | 514KB | Image | download |
Fig. 2 | 236KB | Image | download |
Fig. 3 | 246KB | Image | download |
【 图 表 】
Fig. 3
Fig. 2
Fig. 2
Fig. 1
Fig. 3
Fig. 1
40517_2023_259_Article_IEq83.gif
40517_2023_259_Article_IEq82.gif
Fig. 4
40517_2023_259_Article_IEq79.gif
40517_2023_259_Article_IEq78.gif
40517_2023_259_Article_IEq77.gif
40517_2023_259_Article_IEq76.gif
40517_2023_259_Article_IEq75.gif
Fig. 3
40517_2023_259_Article_IEq73.gif
40517_2023_259_Article_IEq72.gif
40517_2023_259_Article_IEq71.gif
40517_2023_259_Article_IEq70.gif
Fig. 2
40517_2023_259_Article_IEq67.gif
40517_2023_259_Article_IEq62.gif
40517_2023_259_Article_IEq86.gif
Fig. 1
40517_2023_259_Article_IEq27.gif
Fig. 1
13011_2023_540_Article_IEq1.gif
41116_2023_37_Article_IEq236.gif
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]
- [35]
- [36]
- [37]
- [38]
- [39]
- [40]
- [41]
- [42]
- [43]
- [44]
- [45]
- [46]
- [47]
- [48]
- [49]
- [50]
- [51]
- [52]
- [53]
- [54]
- [55]
- [56]
- [57]
- [58]
- [59]
- [60]
- [61]