期刊论文详细信息
BMC Medical Research Methodology
Permutation-based variance component test in generalized linear mixed model with application to multilocus genetic association study
Feng Chen3  Ting Wang2  Hongliang Li1  Yang Zhao3  Ping Zeng2 
[1] Center for Disease Control and Prevention of Pudong New Area, Pudong New Area, Shanghai 200136, People’s Republic of China;Department of Epidemiology and Biostatistics, Center of Medical Statistics and Data Analysis, School of Public Health, Xuzhou Medical College, Xuzhou 221004, Jiangsu, People’s Republic of China;Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing 211166, , Jiangsu, People’s Republic of China
关键词: Logistic mixed effects model;    Generalized linear mixed model;    Penalized quasi-likelihood algorithm;    Working response;    Multilocus association analysis;    Variance component;    Permutation procedure;    Likelihood ratio test;   
Others  :  1178884
DOI  :  10.1186/s12874-015-0030-1
 received in 2014-12-04, accepted in 2015-04-07,  发布年份 2015
PDF
【 摘 要 】

Background

In many medical studies the likelihood ratio test (LRT) has been widely applied to examine whether the random effects variance component is zero within the mixed effects models framework; whereas little work about likelihood-ratio based variance component test has been done in the generalized linear mixed models (GLMM), where the response is discrete and the log-likelihood cannot be computed exactly. Before applying the LRT for variance component in GLMM, several difficulties need to be overcome, including the computation of the log-likelihood, the parameter estimation and the derivation of the null distribution for the LRT statistic.

Methods

To overcome these problems, in this paper we make use of the penalized quasi-likelihood algorithm and calculate the LRT statistic based on the resulting working response and the quasi-likelihood. The permutation procedure is used to obtain the null distribution of the LRT statistic. We evaluate the permutation-based LRT via simulations and compare it with the score-based variance component test and the tests based on the mixture of chi-square distributions. Finally we apply the permutation-based LRT to multilocus association analysis in the case–control study, where the problem can be investigated under the framework of logistic mixed effects model.

Results

The simulations show that the permutation-based LRT can effectively control the type I error rate, while the score test is sometimes slightly conservative and the tests based on mixtures cannot maintain the type I error rate. Our studies also show that the permutation-based LRT has higher power than these existing tests and still maintains a reasonably high power even when the random effects do not follow a normal distribution. The application to GAW17 data also demonstrates that the proposed LRT has a higher probability to identify the association signals than the score test and the tests based on mixtures.

Conclusions

In the present paper the permutation-based LRT was developed for variance component in GLMM. The LRT outperforms existing tests and has a reasonably higher power under various scenarios; additionally, it is conceptually simple and easy to implement.

【 授权许可】

   
2015 Zeng et al.; licensee BioMed Central.

【 预 览 】
附件列表
Files Size Format View
20150505023837316.pdf 441KB PDF download
【 参考文献 】
  • [1]Self SG, Liang KY: Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. J Roy Stat Soc B 1987, 82(398):605-10.
  • [2]Stram DO, Lee JW: Variance components testing in the longitudinal mixed effects model. Biometrics 1994, 50(4):1171-7.
  • [3]Liang KY, Self SG: On the asymptotic behaviour of the pseudolikelihood ratio test statistic. J Roy Stat Soc B 1996, 58(4):785-96.
  • [4]Lindquist MA, Spicer J, Asllani I, Wager TD: Estimating and testing variance components in a multi-level GLM. Neuroimage 2012, 59(1):490-501.
  • [5]Drikvandi R, Verbeke G, Khodadadi A, Partovi Nia V: Testing multiple variance components in linear mixed-effects models. Biostatistics 2013, 14(1):144-59.
  • [6]Nobre J, Singer J, Sen P: U-tests for variance components in linear mixed models. TEST 2013, 22(4):580-605.
  • [7]Claeskens G: Restricted likelihood ratio lack-of-fit tests using mixed spline models. J Roy Stat Soc B 2004, 66(4):909-26.
  • [8]Crainiceanu CM, Ruppert D: Likelihood ratio tests in linear mixed models with one variance component. J Roy Stat Soc B 2004, 66(1):165-85.
  • [9]Crainiceanu CM, Ruppert D: Restricted likelihood ratio tests in nonparametric longitudinal models. Stat Sinica 2004, 14(3):713-30.
  • [10]Crainiceanu CM, Ruppert D: Likelihood ratio tests for goodness-of-fit of a nonlinear regression model. J Multivariate Anal 2004, 91(1):35-52.
  • [11]Crainiceanu C, Ruppert D, Claeskens G, Wand MP: Exact likelihood ratio tests for penalised splines. Biometrika 2005, 92(1):91-103.
  • [12]Greven S, Crainiceanu CM, Küchenhoff H, Peters A: Restricted likelihood ratio testing for zero variance components in linear mixed models. J Comput Graph Stat 2008, 17(4):870-91.
  • [13]Pinheiro JC, Bates D: Mixed-effects models in S and S-PLUS. 2nd edition. Springer, New York; 2009.
  • [14]Fitzmaurice GM, Lipsitz SR, Ibrahim JG: A note on permutation tests for variance components in multilevel generalized linear mixed models. Biometrics 2007, 63(3):942-6.
  • [15]Faraway JJ: Extending the linear model with R: generalized linear, mixed effects and nonparametric regression models. Chapman & Hall/CRC, New York; 2005.
  • [16]Samuh MH, Grilli L, Rampichini C, Salmaso L, Lunardon N: The use of permutation tests for variance components in linear mixed models. Commun Stat-Theor M 2012, 41(16–17):3020-9.
  • [17]Lee OE, Braun TM: Permutation tests for random effects in linear mixed models. Biometrics 2012, 68(2):486-93.
  • [18]Verbeke G, Molenberghs G: Linear mixed models for longitudinal data. Springer, New York; 2009.
  • [19]Laird NM, Ware JH: Random-effects models for longitudinal data. Biometrics 1982, 38(4):963-74.
  • [20]Breslow N, Clayton D: Approximate inference in generalized linear mixed models. J Am Stat Assoc 1993, 88(421):9-25.
  • [21]Diggle P, Heagerty P, Liang KY, Zeger S: Analysis of longitudinal data. 2nd edition. Oxford University Press, New York; 2002.
  • [22]Lin X: Variance component testing in generalised linear models with random effects. Biometrika 1997, 84(2):309-26.
  • [23]Zhang D, Lin X: Hypothesis testing in semiparametric additive mixed models. Biostatistics 2003, 4(1):57-74.
  • [24]Lin X, Zhang D: Inference in generalized additive mixed models by using smoothing splines. J Roy Stat Soc B 1999, 61(2):381-400.
  • [25]Sinha SK: Bootstrap tests for variance components in generalized linear mixed models. Can J Stat 2009, 37(2):219-34.
  • [26]Balding D: A tutorial on statistical methods for population association studies. Nat Rev Genet 2006, 7(10):781-91.
  • [27]Zeng P, Zhao Y, Liu J, Liu L, Zhang L, Wang T, et al.: Likelihood ratio tests in rare variant detection for continuous phenotypes. Ann Hum Genet 2014, 78(5):320-32.
  • [28]Tzeng J, Zhang D: Haplotype-based association analysis via variance-components score test. Am J Hum Genet 2007, 81(5):927-38.
  • [29]Kwee L, Liu D, Lin X, Ghosh D, Epstein M: A powerful and flexible multilocus association test for quantitative traits. Am J Hum Genet 2008, 82(2):386-97.
  • [30]Wu MC, Kraft P, Epstein MP, Taylor DM, Chanock SJ, Hunter DJ, et al.: Powerful SNP-set analysis for case–control genome-wide association studies. Am J Hum Genet 2010, 86(6):929-42.
  • [31]Wu MC, Lee S, Cai T, Li Y, Boehnke M, Lin X: Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet 2011, 89(1):82-93.
  • [32]Molenberghs G, Verbeke G: Models for discrete longitudinal data. Springer, New York; 2005.
  • [33]Fitzmaurice GM, Laird NM, Ware JH: Applied longitudinal analysis. John Wiley & Sons, New York; 2004.
  • [34]Wolfinger R, O’Connell M: Generalized linear mixed models a pseudo-likelihood approach. J Stat Comput Sim 1993, 48(3–4):233-43.
  • [35]Venables WN, Ripley BD: Modern applied statistics with S. 4th edition. Springer, New York; 2002.
  • [36]Davison AC, Hinkley DV: Bootstrap methods and their application. Cambridge University Press, Cambridge; 1997.
  • [37]Efron B, Tibshirani R: An introduction to the bootstrap. Chapman & Hall/CRC, New York; 1993.
  • [38]Good P: Permutation, parametric, and bootstrap tests of hypotheses. 3rd edition. Springer, New York; 2005.
  • [39]Goeman JJ, van de Geer SA, de Kort F, van Houwelingen HC: A global test for groups of genes: testing association with a clinical outcome. Bioinformatics 2004, 20(1):93-9.
  • [40]Lee S, Miropolsky L, Wu M: SKAT: SNP-set (Sequence) kernel association test. R package version 0.91. 2013.
  • [41]R Core Team: R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria; 2013.
  • [42]Schaffner SF, Foo C, Gabriel S, Reich D, Daly MJ, Altshuler D: Calibrating a coalescent simulation of human genome sequence variation. Genome Res 2005, 15(11):1576-83.
  • [43]Almasy L, Dyer T, Peralta J, Kent J, Charlesworth J, Curran J, et al.: Genetic analysis workshop 17 mini-exome simulation. BMC Proc 2011, 5(Suppl 9):S2. BioMed Central Full Text
  • [44]The 1000 Genomes Project Consortium: A map of human genome variation from population-scale sequencing Nature 2010, 467(7319):1061-73.
  • [45]Revolution Analytics: doMC: Foreach parallel adaptor for the multicore package. R package version 1.3.3. 2014.
  • [46]Li M, He Z, Zhang M, Zhan X, Wei C, Elston RC, et al.: A generalized genetic random field method for the genetic association analysis of sequencing data. Genet Epidemiol 2014, 38(3):242-53.
  • [47]Breslow NE, Lin X: Bias correction in generalised linear mixed models with a single component of dispersion. Biometrika 1995, 82(1):81-91.
  • [48]Lin X, Breslow NE: Bias correction in generalized linear mixed models with multiple components of dispersion. J Am Stat Assoc 1996, 91(435):1007-16.
  • [49]Bansal V, Libiger O, Torkamani A, Schork NJ: Statistical analysis strategies for association studies involving rare variants. Nat Rev Genet 2010, 11(11):773-85.
  • [50]Lee S, Abecasis Gonçalo R, Boehnke M, Lin X: Rare-variant association analysis: study designs and statistical tests. Am J Hum Genet 2014, 95(1):5-23.
  • [51]Moutsianas L, Morris AP. Methodology for the analysis of rare genetic variation in genome-wide association and re-sequencing studies of complex human traits. Brief Funct Genomics. 2014. doi: 10.1093/bfgp/elu012.
  • [52]Derkach A, Lawless JF, Sun L: Pooled association tests for rare genetic variants: a review and some new results. Stat Sci 2014, 29(2):302-21.
  文献评价指标  
  下载次数:2次 浏览次数:5次