期刊论文详细信息
BMC Bioinformatics
Whole genome SNP genotype piecemeal imputation
Methodology Article
Paul Stothard1  Yining Wang2  Guohui Lin2  Tim Wylie3 
[1] Department of Agricultural, Food, and Nutritional Science, University of Alberta, T6G 2C8, Edmonton, Alberta, Canada;Department of Computing Science, University of Alberta, Alberta T6G 2E8, Edmonton, Canada;Department of Computing Science, University of Alberta, Alberta T6G 2E8, Edmonton, Canada;Currently with Department of Computer Science, University of Texas – Rio Grande Valley, 78539, Edinburg, Texas, USA;
关键词: Single Nucleotide Polymorphism Genotype;    Imputation Accuracy;    Genotype Imputation;    Density Single Nucleotide Polymorphism;    Genotyped Animal;   
DOI  :  10.1186/s12859-015-0770-2
 received in 2015-01-19, accepted in 2015-10-09,  发布年份 2015
来源: Springer
PDF
【 摘 要 】

BackgroundDespite ongoing reductions in the cost of sequencing technologies, whole genome SNP genotype imputation is often used as an alternative for obtaining abundant SNP genotypes for genome wide association studies. Several existing genotype imputation methods can be efficient for this purpose, while achieving various levels of imputation accuracy. Recent empirical results have shown that the two-step imputation may improve accuracy by imputing the low density genotyped study animals to a medium density array first and then to the target density. We are interested in building a series of staircase arrays that lead the low density array to the high density array or even the whole genome, such that genotype imputation along these staircases can achieve the highest accuracy.ResultsFor genotype imputation from a lower density to a higher density, we first show how to select untyped SNPs to construct a medium density array. Subsequently, we determine for each selected SNP those untyped SNPs to be imputed in the add-one two-step imputation, and lastly how the clusters of imputed genotype are pieced together as the final imputation result. We design extensive empirical experiments using several hundred sequenced and genotyped animals to demonstrate that our novel two-step piecemeal imputation always achieves an improvement compared to the one-step imputation by the state-of-the-art methods Beagle and FImpute. Using the two-step piecemeal imputation, we present some preliminary success on whole genome SNP genotype imputation for genotyped animals via a series of staircase arrays.ConclusionsFrom a low SNP density to the whole genome, intermediate pseudo-arrays can be computationally constructed by selecting the most informative SNPs for untyped SNP genotype imputation. Such pseudo-array staircases are able to impute more accurately than the classic one-step imputation.

【 授权许可】

CC BY   
© Wang et al. 2015

【 预 览 】
附件列表
Files Size Format View
RO202311101491361ZK.pdf 690KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  文献评价指标  
  下载次数:3次 浏览次数:0次