BMC Bioinformatics | |
Fast-GBS: a new pipeline for the efficient and highly accurate calling of SNPs from genotyping-by-sequencing data | |
Software | |
Amina Abed1  Maxime Bastien1  Davoud Torkamaneh1  François Belzile1  Jérôme Laroche2  | |
[1] Département de Phytologie, Université Laval, Quebec City, QC, Canada;Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Quebec City, QC, Canada;Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Quebec City, QC, Canada; | |
关键词: NGS; GBS; Bioinformatics pipeline; SNP; Genotype accuracy; | |
DOI : 10.1186/s12859-016-1431-9 | |
received in 2016-06-10, accepted in 2016-12-16, 发布年份 2017 | |
来源: Springer | |
【 摘 要 】
BackgroundNext-generation sequencing (NGS) technologies have accelerated considerably the investigation into the composition of genomes and their functions. Genotyping-by-sequencing (GBS) is a genotyping approach that makes use of NGS to rapidly and economically scan a genome. It has been shown to allow the simultaneous discovery and genotyping of thousands to millions of SNPs across a wide range of species. For most users, the main challenge in GBS is the bioinformatics analysis of the large amount of sequence information derived from sequencing GBS libraries in view of calling alleles at SNP loci. Herein we describe a new GBS bioinformatics pipeline, Fast-GBS, designed to provide highly accurate genotyping, to require modest computing resources and to offer ease of use.ResultsFast-GBS is built upon standard bioinformatics language and file formats, is capable of handling data from different sequencing platforms, is capable of detecting different kinds of variants (SNPs, MNPs, and Indels). To illustrate its performance, we called variants in three collections of samples (soybean, barley, and potato) that cover a range of different genome sizes, levels of genome complexity, and ploidy. Within these small sets of samples, we called 35 k, 32 k and 38 k SNPs for soybean, barley and potato, respectively. To assess genotype accuracy, we compared these GBS-derived SNP genotypes with independent data sets obtained from whole-genome sequencing or SNP arrays. This analysis yielded estimated accuracies of 98.7, 95.2, and 94% for soybean, barley, and potato, respectively.ConclusionsWe conclude that Fast-GBS provides a highly efficient and reliable tool for calling SNPs from GBS data.
【 授权许可】
CC BY
© The Author(s). 2017
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202311099524204ZK.pdf | 532KB | download |
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]
- [35]
- [36]
- [37]
- [38]
- [39]
- [40]
- [41]
- [42]