期刊论文详细信息
BMC Bioinformatics
ShrinkBayes: a versatile R-package for analysis of count-based sequencing data in complex study designs
Henk MW Verheul3  Daoud Sie1  Tineke E Buffart3  Maarten Neerincx3  Mark A van de Wiel2 
[1]Department of Pathology, VU University medical center, De Boelelaan 1117, 1081 HV Amsterdam, The Netherlands
[2]Department of Mathematics, VU University, De Boelelaan 1081a, 1081 HV Amsterdam, The Netherlands
[3]Department of Medical Oncology, VU University medical center, De Boelelaan 1117, 1081 HV Amsterdam, The Netherlands
关键词: Bayesian analysis;    Sequencing;    Shrinkage;    Differential expression;   
Others  :  818651
DOI  :  10.1186/1471-2105-15-116
 received in 2013-09-24, accepted in 2014-04-11,  发布年份 2014
PDF
【 摘 要 】

Background

Complex designs are common in (observational) clinical studies. Sequencing data for such studies are produced more and more often, implying challenges for the analysis, such as excess of zeros, presence of random effects and multi-parameter inference. Moreover, when sample sizes are small, inference is likely to be too liberal when, in a Bayesian setting, applying a non-appropriate prior or to lack power when not carefully borrowing information across features.

Results

We show on microRNA sequencing data from a clinical cancer study how our software ShrinkBayes tackles the aforementioned challenges. In addition, we illustrate its comparatively good performance on multi-parameter inference for groups using a data-based simulation. Finally, in the small sample size setting, we demonstrate its high power and improved FDR estimation by use of Gaussian mixture priors that include a point mass.

Conclusion

ShrinkBayes is a versatile software package for the analysis of count-based sequencing data, which is particularly useful for studies with small sample sizes or complex designs.

【 授权许可】

   
2014 van de Wiel et al.; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20140711131106319.pdf 247KB PDF download
Figure 2. 58KB Image download
Figure 1. 47KB Image download
【 图 表 】

Figure 1.

Figure 2.

【 参考文献 】
  • [1]Soneson C, Delorenzi M: A comparison of methods for differential expression analysis of RNA-seq data. BMC Bioinformatics 2013, 14:91. BioMed Central Full Text
  • [2]Van de Wiel MA, Leday GGR, Pardo L, Rue H, van der Vaart AW, van Wieringen WN: Bayesian analysis of RNA sequencing data by estimating multiple shrinkage priors. Biostatistics 2012, 14:113-128.
  • [3]Rue H, Martino S, Chopin N: Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations (with discussion). J R Stat Soc B 2009, 71:319-392.
  • [4]Van de Wiel MA, de Menezes RX, Siebring-van Olst E, van Beusechem VW: Analysis of small-sample clinical genomics studies using multi-parameter shrinkage: application to high-throughput RNA interference screening. BMC Med Genom 2013, 6:1. BioMed Central Full Text
  • [5]Efron B: Large-scale Inference. Institute of Mathematical Statistics Monographs. Cambridge: Cambridge University Press; 2010.
  • [6]Ventrucci M, Scott EM, Cocchi D: Multiple testing on standardized mortality ratios: a Bayesian hierarchical model for FDR estimation. Biostatistics 2011, 12:51-67.
  • [7]Wetzels R, Grasman RPPP, Wagenmakers E-J: An encompassing prior generalization of the Savage-Dickey density ratio. Comp Stat Data Anal 2010, 54:2094-2102.
  • [8]Robinson MD, McCarthy DJ, Smyth GK: edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 2010, 26:139-140.
  • [9]Anders S, Huber W: Differential expression analysis for sequence count data. Genome Biol 2010, 11:106. BioMed Central Full Text
  • [10]Si Y, Liu P: An optimal test with maximum average power while controlling FDR with application to RNA-seq data. Biometrics 2013, 69:594-605.
  文献评价指标  
  下载次数:35次 浏览次数:23次