BMC Genomics | |
Improvement of prediction ability by integrating multi-omic datasets in barley | |
Philipp Westhoff1  Alexander Erban2  Marius Weisweiler3  Benjamin Stich3  Delphine Van Inghelandt3  Asis Shrestha3  Po-Ya Wu3  | |
[1] Cluster of Excellence on Plant Sciences (CEPLAS), Heinrich Heine University;Department of Molecular Physiology, Max-Planck-Institute of Molecular Plant Physiology;Institute of Quantitative Genetics and Genomics of Plants, Heinrich Heine University; | |
关键词: Barley; Deleterious SV; Transcriptome; Metabolome; Genomic prediction; Omic prediction; | |
DOI : 10.1186/s12864-022-08337-7 | |
来源: DOAJ |
【 摘 要 】
Abstract Background Genomic prediction (GP) based on single nucleotide polymorphisms (SNP) has become a broadly used tool to increase the gain of selection in plant breeding. However, using predictors that are biologically closer to the phenotypes such as transcriptome and metabolome may increase the prediction ability in GP. The objectives of this study were to (i) assess the prediction ability for three yield-related phenotypic traits using different omic datasets as single predictors compared to a SNP array, where these omic datasets included different types of sequence variants (full-SV, deleterious-dSV, and tolerant-tSV), different types of transcriptome (expression presence/absence variation-ePAV, gene expression-GE, and transcript expression-TE) sampled from two tissues, leaf and seedling, and metabolites (M); (ii) investigate the improvement in prediction ability when combining multiple omic datasets information to predict phenotypic variation in barley breeding programs; (iii) explore the predictive performance when using SV, GE, and ePAV from simulated 3’end mRNA sequencing of different lengths as predictors. Results The prediction ability from genomic best linear unbiased prediction (GBLUP) for the three traits using dSV information was higher than when using tSV, all SV information, or the SNP array. Any predictors from the transcriptome (GE, TE, as well as ePAV) and metabolome provided higher prediction abilities compared to the SNP array and SV on average across the three traits. In addition, some (di)-similarity existed between different omic datasets, and therefore provided complementary biological perspectives to phenotypic variation. Optimal combining the information of dSV, TE, ePAV, as well as metabolites into GP models could improve the prediction ability over that of the single predictors alone. Conclusions The use of integrated omic datasets in GP model is highly recommended. Furthermore, we evaluated a cost-effective approach generating 3’end mRNA sequencing with transcriptome data extracted from seedling without losing prediction ability in comparison to the full-length mRNA sequencing, paving the path for the use of such prediction methods in commercial breeding programs.
【 授权许可】
Unknown