期刊论文详细信息
BMC Genomics
A benchmark study on error-correction by read-pairing and tag-clustering in amplicon-based deep sequencing
Research Article
Ren Sun1  Nicholas C. Wu2  Tian-Hao Zhang3 
[1] Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, 90095, Los Angeles, CA, USA;Molecular Biology Institute, University of California, 90095, Los Angeles, CA, USA;Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, 90095, Los Angeles, CA, USA;Molecular Biology Institute, University of California, 90095, Los Angeles, CA, USA;Department of Integrative Structural and Computational Biology, The Scripps Research Institute, 92037, La Jolla, CA, USA;Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, 90095, Los Angeles, CA, USA;School of Life Science, Fudan University, 200433, Shanghai, China;Molecular Biology Institute, University of California, 90095, Los Angeles, CA, USA;
关键词: Deep sequencing;    Amplicon sequencing;    Error-correction;    Tag-clustering;    Read-pairing;    Error rate;   
DOI  :  10.1186/s12864-016-2388-9
 received in 2015-09-17, accepted in 2016-01-08,  发布年份 2016
来源: Springer
PDF
【 摘 要 】

BackgroundThe high error rate of next generation sequencing (NGS) restricts some of its applications, such as monitoring virus mutations and detecting rare mutations in tumors. There are two commonly employed sequencing library preparation strategies to improve sequencing accuracy by correcting sequencing errors: read-pairing method and tag-clustering method (i.e. primer ID or UID). Here, we constructed a homogeneous library from a single clone, and compared the variant calling accuracy of these error-correction methods.ResultWe comprehensively described the strengths and pitfalls of these methods. We found that both read-pairing and tag-clustering methods significantly decreased sequencing error rate. While the read-pairing method was more effective than the tag-clustering method at correcting insertion and deletion errors, it was not as effective as the tag-clustering method at correcting substitution errors. In addition, we observed that when the read quality was poor, the tag-clustering method led to huge coverage loss. We also tested the effect of applying quality score filtering to the error-correction methods and demonstrated that quality score filtering was able to impose a minor, yet statistically significant improvement to the error-correction methods tested in this study.ConclusionOur study provides a benchmark for researchers to select suitable error-correction methods based on the goal of the experiment by balancing the trade-off between sequencing cost (i.e. sequencing coverage requirement) and detection sensitivity.

【 授权许可】

CC BY   
© Zhang et al. 2016

【 预 览 】
附件列表
Files Size Format View
RO202311093968781ZK.pdf 1692KB PDF download
12864_2017_3733_Article_IEq39.gif 1KB Image download
12864_2015_1944_Article_IEq10.gif 1KB Image download
12864_2017_3676_Article_IEq1.gif 1KB Image download
12864_2016_2388_Article_IEq4.gif 1KB Image download
12864_2017_3733_Article_IEq43.gif 1KB Image download
12864_2017_3781_Article_IEq3.gif 1KB Image download
【 图 表 】

12864_2017_3781_Article_IEq3.gif

12864_2017_3733_Article_IEq43.gif

12864_2016_2388_Article_IEq4.gif

12864_2017_3676_Article_IEq1.gif

12864_2015_1944_Article_IEq10.gif

12864_2017_3733_Article_IEq39.gif

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  文献评价指标  
  下载次数:32次 浏览次数:0次