期刊论文详细信息
BMC Bioinformatics
LaRA 2: parallel and vectorized program for sequence–structure alignment of RNA sequences
Elisa Ficarra1  Jörg Winkler2  Knut Reinert2  Gianvito Urgese3 
[1] Department of Control and Computer Science, Politecnico di Torino, C.so Duca degli Abruzzi 24, 10129, Turin, Italy;Department of Mathematics and Computer Science, Free University Berlin, Takustraße 9, 14195, Berlin, Germany;Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195, Berlin, Germany;Interuniversity Department of Regional and Urban Studies and Planning, Politecnico di Torino, C.so Duca degli Abruzzi 24, 10129, Turin, Italy;
关键词: RNA secondary structure;    Integer linear program;    Bioinformatics;    Structural alignment;    RNA;    Algorithms;    Parallel computing;   
DOI  :  10.1186/s12859-021-04532-7
来源: Springer
PDF
【 摘 要 】

BackgroundThe function of non-coding RNA sequences is largely determined by their spatial conformation, namely the secondary structure of the molecule, formed by Watson–Crick interactions between nucleotides. Hence, modern RNA alignment algorithms routinely take structural information into account. In order to discover yet unknown RNA families and infer their possible functions, the structural alignment of RNAs is an essential task. This task demands a lot of computational resources, especially for aligning many long sequences, and it therefore requires efficient algorithms that utilize modern hardware when available. A subset of the secondary structures contains overlapping interactions (called pseudoknots), which add additional complexity to the problem and are often ignored in available software.ResultsWe present the SeqAn-based software LaRA 2 that is significantly faster than comparable software for accurate pairwise and multiple alignments of structured RNA sequences. In contrast to other programs our approach can handle arbitrary pseudoknots. As an improved re-implementation of the LaRA tool for structural alignments, LaRA 2 uses multi-threading and vectorization for parallel execution and a new heuristic for computing a lower boundary of the solution. Our algorithmic improvements yield a program that is up to 130 times faster than the previous version.ConclusionsWith LaRA 2 we provide a tool to analyse large sets of RNA secondary structures in relatively short time, based on structural alignment. The produced alignments can be used to derive structural motifs for the search in genomic databases.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202203113876879ZK.pdf 2260KB PDF download
  文献评价指标  
  下载次数:13次 浏览次数:11次