BMC Bioinformatics | |
CORNAS: coverage-dependent RNA-Seq analysis of gene expression data without biological replicates | |
Research | |
Martti T. Tammi1  Joel Z. B. Low2  Tsung Fei Khang3  | |
[1] Institute of Biological Sciences, Faculty of Science, University of Malaya, 50603, Kuala Lumpur, Malaysia;Institute of Biological Sciences, Faculty of Science, University of Malaya, 50603, Kuala Lumpur, Malaysia;Sime Darby Technology Centre Sdn. Bhd., UPM-MTDC Technology Centre III, University Putra Malaysia, 43400, Serdang, Malaysia;Institute of Mathematical Sciences, Faculty of Science, University of Malaya, 50603, Kuala Lumpur, Malaysia;University of Malaya Centre for Data Analytics, University of Malaya, 50603, Kuala Lumpur, Malaysia; | |
关键词: RNA-Seq; Unreplicated experiments; Bayesian statistics; Differential gene expression; Sequencing coverage; Illumina; | |
DOI : 10.1186/s12859-017-1974-4 | |
来源: Springer | |
【 摘 要 】
BackgroundIn current statistical methods for calling differentially expressed genes in RNA-Seq experiments, the assumption is that an adjusted observed gene count represents an unknown true gene count. This adjustment usually consists of a normalization step to account for heterogeneous sample library sizes, and then the resulting normalized gene counts are used as input for parametric or non-parametric differential gene expression tests. A distribution of true gene counts, each with a different probability, can result in the same observed gene count. Importantly, sequencing coverage information is currently not explicitly incorporated into any of the statistical models used for RNA-Seq analysis.ResultsWe developed a fast Bayesian method which uses the sequencing coverage information determined from the concentration of an RNA sample to estimate the posterior distribution of a true gene count. Our method has better or comparable performance compared to NOISeq and GFOLD, according to the results from simulations and experiments with real unreplicated data. We incorporated a previously unused sequencing coverage parameter into a procedure for differential gene expression analysis with RNA-Seq data.ConclusionsOur results suggest that our method can be used to overcome analytical bottlenecks in experiments with limited number of replicates and low sequencing coverage. The method is implemented in CORNAS (Coverage-dependent RNA-Seq), and is available at https://github.com/joel-lzb/CORNAS.
【 授权许可】
CC BY
© The Author(s) 2017
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202311109055358ZK.pdf | 1441KB | download | |
Fig. 5 | 3355KB | Image | download |
Fig. 4 | 796KB | Image | download |
Table 1 | 92KB | Table | download |
【 图 表 】
Fig. 4
Fig. 5
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]
- [35]
- [36]
- [37]
- [38]
- [39]
- [40]
- [41]
- [42]
- [43]
- [44]
- [45]
- [46]