期刊论文详细信息
Methods and Protocols
Combining Multiple RNA-Seq Data Analysis Algorithms Using Machine Learning Improves Differential Isoform Expression Analysis
Alexandros C. Dimopoulos1  Panagiotis Moulos1  Konstantinos Koukoutegos2  Fotis E. Psomopoulos2 
[1]Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center ‘Alexander Fleming’, Fleming 34, 16672 Vari, Greece
[2]Institute of Applied Biosciences, Centre for Research and Technology Hellas, 6th Km Charilaou-Thermis Rd, 57001 Thessaloniki, Greece
关键词: RNA-sequencing;    alternative splicing;    machine learning;    docker;    shiny;   
DOI  :  10.3390/mps4040068
来源: DOAJ
【 摘 要 】
RNA sequencing has become the standard technique for high resolution genome-wide monitoring of gene expression. As such, it often comprises the first step towards understanding complex molecular mechanisms driving various phenotypes, spanning organ development to disease genesis, monitoring and progression. An advantage of RNA sequencing is its ability to capture complex transcriptomic events such as alternative splicing which results in alternate isoform abundance. At the same time, this advantage remains algorithmically and computationally challenging, especially with the emergence of even higher resolution technologies such as single-cell RNA sequencing. Although several algorithms have been proposed for the effective detection of differential isoform expression from RNA-Seq data, no widely accepted golden standards have been established. This fact is further compounded by the significant differences in the output of different algorithms when applied on the same data. In addition, many of the proposed algorithms remain scarce and poorly maintained. Driven by these challenges, we developed a novel integrative approach that effectively combines the most widely used algorithms for differential transcript and isoform analysis using state-of-the-art machine learning techniques. We demonstrate its usability by applying it on simulated data based on several organisms, and using several performance metrics; we conclude that our strategy outperforms the application of the individual algorithms. Finally, our approach is implemented as an R Shiny application, with the underlying data analysis pipelines also available as docker containers.
【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次