BMC Genomics | |
Variant calling in low-coverage whole genome sequencing of a Native American population sample | |
Research Article | |
Scott A Chasse1  Yun Li1  Cindy L Ehlers2  Ian R Gizer3  Chris Bizon4  Michael Spiegel4  Xiaoshu Wang4  Josh K Sailsbery4  Kirk C Wilhelmsen5  Piotr A Mieczkowski6  Ewa P Malc6  | |
[1] Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, USA;Department of Molecular and Cellular Neuroscience, The Scripps Research Institute, La Jolla, USA;Department of Psychological Sciences, University of Missouri-Columbia, Columbia, USA;Renaissance Computing Institute, University of North Carolina at Chapel Hill, Chapel Hill, USA;Renaissance Computing Institute, University of North Carolina at Chapel Hill, Chapel Hill, USA;Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, USA;UNC High Throughput Sequencing Facility, Chapel Hill, USA; | |
关键词: Whole Genome Sequencing; Variant Calling; Variant Call; Kinship Coefficient; Whole Genome Sequencing Data; | |
DOI : 10.1186/1471-2164-15-85 | |
received in 2013-07-26, accepted in 2014-01-24, 发布年份 2014 | |
来源: Springer | |
【 摘 要 】
BackgroundThe reduction in the cost of sequencing a human genome has led to the use of genotype sampling strategies in order to impute and infer the presence of sequence variants that can then be tested for associations with traits of interest. Low-coverage Whole Genome Sequencing (WGS) is a sampling strategy that overcomes some of the deficiencies seen in fixed content SNP array studies. Linkage-disequilibrium (LD) aware variant callers, such as the program Thunder, may provide a calling rate and accuracy that makes a low-coverage sequencing strategy viable.ResultsWe examined the performance of an LD-aware variant calling strategy in a population of 708 low-coverage whole genome sequences from a community sample of Native Americans. We assessed variant calling through a comparison of the sequencing results to genotypes measured in 641 of the same subjects using a fixed content first generation exome array. The comparison was made using the variant calling routines GATK Unified Genotyper program and the LD-aware variant caller Thunder. Thunder was found to improve concordance in a coverage dependent fashion, while correctly calling nearly all of the common variants as well as a high percentage of the rare variants present in the sample.ConclusionsLow-coverage WGS is a strategy that appears to collect genetic information intermediate in scope between fixed content genotyping arrays and deep-coverage WGS. Our data suggests that low-coverage WGS is a viable strategy with a greater chance of discovering novel variants and associations than fixed content arrays for large sample association analyses.
【 授权许可】
Unknown
© Bizon et al.; licensee BioMed Central Ltd. 2014. This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202311091632876ZK.pdf | 880KB | download |
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]