| BMC Genetics | |
| Empirical Bayesian LASSO-logistic regression for multiple binary trait locus mapping | |
| Xiaodong Cai1  Shizhong Xu2  Anhui Huang1  | |
| [1] Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL 33146, USA;Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA | |
| 关键词: Logistic regression; Bayesian shrinkage; Epistatic effects; Binary traits; QTL mapping; | |
| Others : 1087378 DOI : 10.1186/1471-2156-14-5 |
|
| received in 2012-10-01, accepted in 2013-02-06, 发布年份 2013 | |
PDF
|
|
【 摘 要 】
Background
Complex binary traits are influenced by many factors including the main effects of many quantitative trait loci (QTLs), the epistatic effects involving more than one QTLs, environmental effects and the effects of gene-environment interactions. Although a number of QTL mapping methods for binary traits have been developed, there still lacks an efficient and powerful method that can handle both main and epistatic effects of a relatively large number of possible QTLs.
Results
In this paper, we use a Bayesian logistic regression model as the QTL model for binary traits that includes both main and epistatic effects. Our logistic regression model employs hierarchical priors for regression coefficients similar to the ones used in the Bayesian LASSO linear model for multiple QTL mapping for continuous traits. We develop efficient empirical Bayesian algorithms to infer the logistic regression model. Our simulation study shows that our algorithms can easily handle a QTL model with a large number of main and epistatic effects on a personal computer, and outperform five other methods examined including the LASSO, HyperLasso, BhGLM, RVM and the single-QTL mapping method based on logistic regression in terms of power of detection and false positive rate. The utility of our algorithms is also demonstrated through analysis of a real data set. A software package implementing the empirical Bayesian algorithms in this paper is freely available upon request.
Conclusions
The EBLASSO logistic regression method can handle a large number of effects possibly including the main and epistatic QTL effects, environmental effects and the effects of gene-environment interactions. It will be a very useful tool for multiple QTLs mapping for complex binary traits.
【 授权许可】
2013 Huang et al; licensee BioMed Central Ltd.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| 20150116025615511.pdf | 353KB |
【 参考文献 】
- [1]Falconer DS, Mackay TFC: Introduction to Quantitative Genetics. 4th edition. Boston: Addison-Wesley; 1996.
- [2]Hackett CA, Weller JI: Genetic mapping of quantitative trait loci for traits with ordinal distributions. Biometrics 1995, 51(4):1252-1263.
- [3]Xu S, Atchley WR: Mapping quantitative trait loci for complex binary diseases using line crosses. Genetics 1996, 143(3):1417-1424.
- [4]Rao S, Xu S: Mapping quantitative trait loci for ordered categorical traits in four-way crosses. Heredity 1998, 81(2):214-224.
- [5]Xu S, Yi N, Burke D, Galecki A, Miller RA: An EM algorithm for mapping binary disease loci: application to fibrosarcoma in a four-way cross mouse family. Genet Res 2003, 82(2):127-138.
- [6]Xu C, Zhang YM, Xu S: An EM algorithm for mapping quantitative resistance loci. Heredity 2004, 94(1):119-128.
- [7]Xu C, Li Z, Xu S: Joint mapping of quantitative trait loci for multiple binary characters. Genetics 2005, 169(2):1045-1059.
- [8]Deng W, Chen H, Li Z: A logistic regression mixture model for interval mapping of genetic trait loci affecting binary phenotypes. Genetics 2006, 172(2):1349-1358.
- [9]Haley CS, Knott SA: A simple regression method for mapping quantitative trait loci in line crosses using flanking markers. Heredity 1992, 69(4):315-324.
- [10]Martínez O, Curnow RN: Estimating the locations and the sizes of the effects of quantitative trait loci using flanking markers. Theor Appl Genet 1992, 85(4):480-488.
- [11]Yi N, Xu S: Bayesian mapping of quantitative trait loci for complex binary traits. Genetics 2000, 155(3):1391-1403.
- [12]Yi N, Xu S: Mapping quantitative trait loci with epistatic effects. Genet Res 2002, 79(2):185-198.
- [13]Yi N, Xu S, George V, Allison DB: Mapping multiple quantitative trait loci for ordinal traits. Behav Genet 2004, 34(1):3-15.
- [14]Yi N, Banerjee S, Pomp D, Yandell BS: Bayesian mapping of genomewide interacting quantitative trait loci for ordinal traits. Genetics 2007, 176(3):1855-1864.
- [15]Huang H, Eversley CD, Threadgill DW, Zou F: Bayesian multiple quantitative trait loci mapping for complex traits using markers of the entire genome. Genetics 2007, 176(4):2529-2540.
- [16]Yang R, Li J, Wang X, Zhou X: Bayesian functional mapping of dynamic quantitative traits. Theor Appl Genet 2011, 123(3):483-492.
- [17]Chen Z, Liu J: Mixture generalized linear models for multiple interval mapping of quantitative trait loci in experimental crosses. Biometrics 2009, 65(2):470-477.
- [18]Li J, Wang S, Zeng ZB: Multiple-interval mapping for ordinal traits. Genetics 2006, 173(3):1649-1663.
- [19]Coffman CJ, Doerge RW, Simonsen KL, Nichols KM, Duarte CK, Wolfinger RD, McIntyre LM: Model selection in binary trait locus mapping. Genetics 2005, 170(3):1281-1297.
- [20]Xu S: Estimating polygenic effects using markers of the entire genome. Genetics 2003, 163(2):789-801.
- [21]Wang H, Zhang YM, Li X, Masinde GL, Mohan S, Baylink DJ, Xu S: Bayesian shrinkage estimation of quantitative trait loci parameters. Genetics 2005, 170:465-480.
- [22]Hoti F, Sillanpää MJ: Bayesian mapping of genotype x expression interactions in quantitative and qualitative traits. Heredity 2006, 97(1):4-18.
- [23]Yi N, Xu S: Bayesian LASSO for quantitative trait loci mapping. Genetics 2008, 179(2):1045-1055.
- [24]Xu S: An empirical Bayes method for estimating epistatic effects of quantitative trait loci. Biometrics 2007, 63(2):513-521.
- [25]Hoggart CJ, Whittaker JC, De Iorio M, Balding DJ: Simultaneous analysis of all SNPs in genome-wide and re-sequencing association studies. PLoS Genet 2008, 4(7):e1000130.
- [26]Yi N, Banerjee S: Hierachical generalized linear models for multiple quantitative trait locus mapping. Genetics 2009, 181:1101-1133.
- [27]Tibshirani R: Regression shrinkage and selection via the lasso. J R Stat Soc Series B Stat Methodol 1996, 58(1):267-288.
- [28]Wu TT, Chen YF, Hastie T, Sobel E, Lange K: Genome-wide association analysis by lasso penalized logistic regression. Bioinformatics 2009, 25:714-721.
- [29]Ayers KL, Cordell HJ: SNP selection in genome-wide and candidate gene studies via penalized logistic regression. Genet Epidemiol 2010, 34(8):879-891.
- [30]Cai X, Huang A, Xu S: Fast empirical Bayesian LASSO for multiple quantitative trait locus mapping. BMC Bioinformatics 2011, 12(1):211. BioMed Central Full Text
- [31]Park T, Casella G: The Bayesian lasso. J Am Stat Assoc 2008, 103(482):681-686.
- [32]Tipping ME: Sparse Bayesian learning and the relevance vector machine. J Mach Learn Res 2001, 1(3):211-244.
- [33]Tipping ME, Faul AC: Fast marginal likelihood maximisation for sparse Bayesian models. Key West, FL: Proc 9th International Workshop on Artificial Intelligence and Statistics; 2003.
- [34]Cockerham CC: An extension of the concept of partitioning hereditary variance for analysis of covariances among relatives when epistasis is present. Genetics 1954, 39(6):859-882.
- [35]Griffin JE, Brown PJ: Bayesian hyper-lassos with non-convex penalization. Aust N Z J Stat 2011, 53(4):423-442.
- [36]Carlin BP, Louis TA: Bayesian methods for data analysis. 3rd edition. London/New York: Chapman & Hall/CRC; 2008.
- [37]Bishop CM: Pattern recognition and machine learning. New York: Springer; 2006.
- [38]MacKay DJC: The evidence framework applied to classification networks. Neural Comput 1992, 4(5):720-736.
- [39]Hastie T, Tibshirani R, Friedman JH: The elements of statistical learning: data mining, inference, and prediction. 2nd edition. New York: Springer; 2009.
- [40]Wu R, Ma CX, Casella G: Statistical genetics of quantitative traits: linkage, maps, and QTL. LLC: Springer Science + Business Media; 2007.
- [41]R Development Core Team: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2012.
- [42]Friedman J, Hastie T, Tibshirani R: Regularization paths for generalized linear models via coordinate descent. J Stat software 2010, 33(1):1-22.
- [43]Masinde GL, Li X, Gu W, Davidson H, Mohan S, Baylink DJ: Identification of wound healing/regeneration Quantitative Trait Loci (QTL) at multiple time points that explain seventy percent of variance in (MRL/MpJ and SJL/J) mice F2 population. Genome Res 2001, 11(12):2027-2033.
- [44]Li X, Mohan S, Gu W, Baylink DJ: Analysis of gene expression in the wound repair/regeneration process. Mamm Genome 2001, 12(1):52-59.
- [45]Kunimoto BT: Growth factors in wound healing: the next great innovation? Ostomy Wound Manage 1999, 45(8):56-64.
- [46]Xu S: An expectation maximization algorithm for the Lasso estimation of quantitative trait locus effects. Heredity 2010, 2010:1-12.
- [47]Yi N, Shriner D, Banerjee S, Mehta T, Pomp D, Yandell BS: An efficient Bayesian model selection approach for interacting quantitative trait loci models with many effects. Genetics 2007, 176(3):1865-1877.
- [48]Fan J, Song R: Sure independence screening in generalized linear models with NP-dimensionality. Ann Stat 2010, 38(6):3567-3604.
- [49]Fan J, Lv J: Sure independence screening for ultrahigh dimensional feature space. J R Stat Soc Series B Stat Methodol 2008, 70(5):849-911.
PDF