期刊论文详细信息
BMC Bioinformatics
A Bayesian method and its variational approximation for prediction of genomic breeding values in multiple traits
Takeshi Hayashi2  Hiroyoshi Iwata1 
[1] Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo, Tokyo, 113-8657, Japan
[2] Agroinformatics Division, National Agriculture and Food Research Organization, Agricultural Research Center, Kannondai, Tsukuba, Ibaraki, 305-8666, Japan
关键词: Variational approximation;    MCMC iteration;    Bayesian regression;    Multiple traits;    Genomic selection;   
Others  :  1088000
DOI  :  10.1186/1471-2105-14-34
 received in 2012-08-22, accepted in 2013-01-22,  发布年份 2013
PDF
【 摘 要 】

Background

Genomic selection is an effective tool for animal and plant breeding, allowing effective individual selection without phenotypic records through the prediction of genomic breeding value (GBV). To date, genomic selection has focused on a single trait. However, actual breeding often targets multiple correlated traits, and, therefore, joint analysis taking into consideration the correlation between traits, which might result in more accurate GBV prediction than analyzing each trait separately, is suitable for multi-trait genomic selection. This would require an extension of the prediction model for single-trait GBV to multi-trait case. As the computational burden of multi-trait analysis is even higher than that of single-trait analysis, an effective computational method for constructing a multi-trait prediction model is also needed.

Results

We described a Bayesian regression model incorporating variable selection for jointly predicting GBVs of multiple traits and devised both an MCMC iteration and variational approximation for Bayesian estimation of parameters in this multi-trait model. The proposed Bayesian procedures with MCMC iteration and variational approximation were referred to as MCBayes and varBayes, respectively. Using simulated datasets of SNP genotypes and phenotypes for three traits with high and low heritabilities, we compared the accuracy in predicting GBVs between multi-trait and single-trait analyses as well as between MCBayes and varBayes. The results showed that, compared to single-trait analysis, multi-trait analysis enabled much more accurate GBV prediction for low-heritability traits correlated with high-heritability traits, by utilizing the correlation structure between traits, while the prediction accuracy for uncorrelated low-heritability traits was comparable or less with multi-trait analysis in comparison with single-trait analysis depending on the setting for prior probability that a SNP has zero effect. Although the prediction accuracy with varBayes was generally lower than with MCBayes, the loss in accuracy was slight. The computational time was greatly reduced with varBayes.

Conclusions

In genomic selection for multiple correlated traits, multi-trait analysis was more beneficial than single-trait analysis and varBayes was much advantageous over MCBayes in computational time, which would outweigh the loss of prediction accuracy caused by the approximation procedure, and is thus considered a practical method of choice.

【 授权许可】

   
2013 Hayashi and Iwata; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20150117063810697.pdf 275KB PDF download
【 参考文献 】
  • [1]Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES: A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One 2011, 6(5):e19379.
  • [2]Meuwissen THE, Hayes B, Goddard ME: Prediction of total genetic value using genome-wide dense marker maps. Genetics 2001, 157:1819-1829.
  • [3]Solberg TR, Sonesson AK, Wooliams JA, Meuwissen THE: Genomic selection using different marker types and densities. J Anim Sci 2008, 86:2447-2454.
  • [4]Habier D, Fernando RL, Kizilkaya K, Garrick DJ: Extension of the Bayesian alphabet for genomic selection. BMC Bioinformatics 2011, 12:186. BioMed Central Full Text
  • [5]Xu S: Estimating polygenic effects using makers of the entire genome. Genetics 1989, 163:789-891.
  • [6]George EI, McCullogh RE: Variable selection via Gibbs sampling. J Am Stat Assoc 1993, 88:881-889.
  • [7]Gianola D, Delos Campos G, HillW G, Manfredi E, Fernando R: Additive genetic variability and the Bayesian alphabet. Genetics 2009, 183:347-363.
  • [8]Xu C, Wang X, Li Z, Xu S: Mapping QTL for multiple traits using Bayesian statistics. Genet Res 2008, 90:1-15.
  • [9]Banerjee S, Yandell BS, Yi N: Bayesian Quantitative Trait Loci Mapping for Multiple Traits. Genetics 2008, 179:2275-2289.
  • [10]Meuwissen THE, Goddard ME: Mapping multiple QTL using linkage disequilibrium and linkage analysis information and multitrait data. Genet Sel Evol 2004, 36:261-279. BioMed Central Full Text
  • [11]Carlin BP, Chib S: Bayesian model choice via Markov chain Monte Carlo. J R Stat Soc Ser B 1995, 57:473-484.
  • [12]Yi N: A unified Markov chain Monte Carlo framework for mapping multiple quantitative trait loci. Genetics 2004, 167:967-975.
  • [13]Calus MP, Veerkamp RF: Accuracy of multi-trait genomic selection using different methods. Genet Sel Evol 2011, 43:26. BioMed Central Full Text
  • [14]Yi N, Banerjee S: Hierarchical generalized linear models for multiple quantitative trait locus mapping. Genetics 2009, 181:1101-1113.
  • [15]Logsdon BA, Hoffman G, Mezey J: A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis. BMC Bioinformatics 2010, 11:58. BioMed Central Full Text
  • [16]Li Z, Sillanpää MJ: Estimation of quantitative trait locus effects with epistasis by variational Bayes algorithms. Genetics 2012, 190:231-249.
  • [17]Hayashi T, Iwata H: EM algorithm for Bayesian estimation of genomic breeding values. BMC Genet 2010, 11:3.
  • [18]Shepherd RK, Meuwissen THE, Wooliams JA: Genomic selection and complex trait prediction using a fast EM algorithm applied to genome-wide markers. BMC Bioinformatics 2010, 11:529. BioMed Central Full Text
  • [19]Attias H: Inferring parameters and structure of latent variable models by variational Bayes. San Francisco, CA: Morgan-Kaufmann; 1999:21-30. [Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence]
  • [20]Bishop CM: Pattern recognition and machine learning. New York: Springer--Verlag; 2006.
  • [21]Beal M: Variational algorithms for approximate Bayesian inference. University of London: PhD thesis; 2003.
  • [22]VanTassel CP, VanVleck LD: Multiple-trait Gibbs sampler for animal models: Flexible programs for Bayesian and likelihood-based (co)variance component inference. J Anim Sci 1996, 74:2586-2597.
  • [23]Mathai AM, Moschopoulos PG: A form of multivariate gamma distribution. Ann Inst Stat Math 1992, 44:97-106.
  • [24]Moser G, Tier B, Crump RE, Khatkar MS, Raadsma HW: A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers. Genet Sel Evol 2009, 41:56. BioMed Central Full Text
  • [25]Luan T, Woolliams JA, Lien S, Kent M, Svendsen M, Meuwissen THE: The accuracy of genomic selection in Norwegian red cattle assessed by cross-validation. Genetics 2009, 183:1119-1126.
  • [26]Saatchi M, McClure MC, McKay SD, Megan M, Rolf MM, Kim JW, Decker JE, Taxis TM, Chapple RH, Ramey HR, et al.: Accuracies of genomic breeding values in American Angus beef cattle using K-means clustering for cross-validation. Genet Sel Evol 2011, 43:40. BioMed Central Full Text
  • [27]Lorenzana RE, Bernardo R: Accuracy of genotypic value predictions for marker-based selection in biparental plant populations. Theor Appl Genet 2009, 120:151-161.
  • [28]Crossa J, Delos Campos G, Pérez P, Gianola D, Burgueño J, Araus JL, Makumbi D, Singh RP, Dreisigacker S, Yan J, et al.: Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers. Genetics 2010, 186:713-724.
  • [29]Heffner EL, Jannink JL, Iwata H, Souza E, Sorrells ME: Genomic selection accuracy for grain quality traits in biparental wheat populations. Crop Sci 2011, 51:2597-2606.
  • [30]Dekkers JCM: Prediction of response to marker-assisted and genomic selection using selection index theory. J Anim Breed Genet 2007, 124:331-341.
  • [31]Kärkkäinen HP, Sillanpää MJ: Robustness of Bayesian multilocus association models to cryptic relatedness. Hum Genet 2012, 76:510-523.
  文献评价指标  
  下载次数:12次 浏览次数:15次