期刊论文详细信息
BMC Bioinformatics
Sources of variation in false discovery rate estimation include sample size, correlation, and inherent differences between groups
Research
Jiexin Zhang1  Kevin R Coombes1 
[1] Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 77030, Houston, Texas, USA;
关键词: False Discovery Rate;    Block Size;    Dispersion Variate;    Large Block Size;    Affymetrix U133A;   
DOI  :  10.1186/1471-2105-13-S13-S1
来源: Springer
PDF
【 摘 要 】

BackgroundHigh-throughtput technologies enable the testing of tens of thousands of measurements simultaneously. Identification of genes that are differentially expressed or associated with clinical outcomes invokes the multiple testing problem. False Discovery Rate (FDR) control is a statistical method used to correct for multiple comparisons for independent or weakly dependent test statistics. Although FDR control is frequently applied to microarray data analysis, gene expression is usually correlated, which might lead to inaccurate estimates. In this paper, we evaluate the accuracy of FDR estimation.MethodsUsing two real data sets, we resampled subgroups of patients and recalculated statistics of interest to illustrate the imprecision of FDR estimation. Next, we generated many simulated data sets with block correlation structures and realistic noise parameters, using the Ultimate Microarray Prediction, Inference, and Reality Engine (UMPIRE) R package. We estimated FDR using a beta-uniform mixture (BUM) model, and examined the variation in FDR estimation.ResultsThe three major sources of variation in FDR estimation are the sample size, correlations among genes, and the true proportion of differentially expressed genes (DEGs). The sample size and proportion of DEGs affect both magnitude and precision of FDR estimation, while the correlation structure mainly affects the variation of the estimated parameters.ConclusionsWe have decomposed various factors that affect FDR estimation, and illustrated the direction and extent of the impact. We found that the proportion of DEGs has a significant impact on FDR; this factor might have been overlooked in previous studies and deserves more thought when controlling FDR.

【 授权许可】

CC BY   
© Zhang and Coombes; licensee BioMed Central Ltd. 2012

【 预 览 】
附件列表
Files Size Format View
RO202311094478843ZK.pdf 2213KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  文献评价指标  
  下载次数:1次 浏览次数:0次