Evolutionary Bioinformatics | |
Grouped False-Discovery Rate for Removing the Gene-set-Level Bias of RNA-seq | |
Tae Young Yang1  | |
关键词: FDRseq; gene-level bias; gene-set analysis; gene-set-level bias; grouped false-discovery rate; RNA-seq; | |
DOI : 10.4137/EBO.S13099 | |
学科分类:生物技术 | |
来源: Sage Journals | |
【 摘 要 】
In recent years, RNA-seq has become a very competitive alternative to microarrays. In RNA-seq experiments, the expected read count for a gene is proportional to its expression level multiplied by its transcript length. Even when two genes are expressed at the same level, differences in length will yield differing numbers of total reads. The characteristics of these RNA-seq experiments create a gene-level bias such that the proportion of significantly differentially expressed genes increases with the transcript length, whereas such bias is not present in microarray data. Gene-set analysis seeks to identify the gene sets that are enriched in the list of the identified significant genes. In the gene-set analysis of RNA-seq, the gene-level bias subsequently yields the gene-set-level bias that a gene set with genes of long length will be more likely to show up as enriched than will a gene set with genes of shorter length. Because gene expression is not related to its transcript length, any gene set containing long genes is not of biologically greater interest than gene sets with shorter genes. Accordingly the gene-set-level bias should be removed to accurately calculate the statistical significance of each gene-set enrichment in the RNA-seq.
【 授权许可】
Unknown
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO201901211367480ZK.pdf | 694KB | download |