期刊论文详细信息
BMC Bioinformatics
Gene set analysis for longitudinal gene expression data
Methodology Article
Youping Deng1  Solomon W Harrar2  Haiyan Wang3  Arne C Bathke4  Hans-Peter Piepho5  Ke Zhang6 
[1] Department of Internal Medicine, Rush University Medical Center, 60612, Chicago, IL, USA;Department of Mathematical Sciences, University of Montana, 59812, Missoula, MT, USA;Department of Statistics, Kansas State University, 66506, Manhattan, KS, USA;Department of Statistics, University of Kentucky, 40506, Lexington, KY, USA;Institut für Kulturpflanzenzüchtung, Universität Hohenheim, D70599, Stuttgart, Germany;School of Medicine & Health Sciences, University of North Dakota, 58202, Grand Forks, ND, USA;
关键词: Permutation Test;    Generalize Estimate Equation;    Linear Mixed Effect Model;    Cauchy Distribution;    Linear Mixed Effect;   
DOI  :  10.1186/1471-2105-12-273
 received in 2011-01-29, accepted in 2011-07-03,  发布年份 2011
来源: Springer
PDF
【 摘 要 】

BackgroundGene set analysis (GSA) has become a successful tool to interpret gene expression profiles in terms of biological functions, molecular pathways, or genomic locations. GSA performs statistical tests for independent microarray samples at the level of gene sets rather than individual genes. Nowadays, an increasing number of microarray studies are conducted to explore the dynamic changes of gene expression in a variety of species and biological scenarios. In these longitudinal studies, gene expression is repeatedly measured over time such that a GSA needs to take into account the within-gene correlations in addition to possible between-gene correlations.ResultsWe provide a robust nonparametric approach to compare the expressions of longitudinally measured sets of genes under multiple treatments or experimental conditions. The limiting distributions of our statistics are derived when the number of genes goes to infinity while the number of replications can be small. When the number of genes in a gene set is small, we recommend permutation tests based on our nonparametric test statistics to achieve reliable type I error and better power while incorporating unknown correlations between and within-genes. Simulation results demonstrate that the proposed method has a greater power than other methods for various data distributions and heteroscedastic correlation structures. This method was used for an IL-2 stimulation study and significantly altered gene sets were identified.ConclusionsThe simulation study and the real data application showed that the proposed gene set analysis provides a promising tool for longitudinal microarray analysis. R scripts for simulating longitudinal data and calculating the nonparametric statistics are posted on the North Dakota INBRE website http://ndinbre.org/programs/bioinformatics.php. Raw microarray data is available in Gene Expression Omnibus (National Center for Biotechnology Information) with accession number GSE6085.

【 授权许可】

CC BY   
© Zhang et al; licensee BioMed Central Ltd. 2011

【 预 览 】
附件列表
Files Size Format View
RO202311107752335ZK.pdf 517KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  文献评价指标  
  下载次数:12次 浏览次数:1次