学位论文详细信息
Error Correction of Second-Generation Sequencing Reads
bioinformatics;sequencing;error correction;Computer Science
Marinier, Eric
University of Waterloo
关键词: bioinformatics;    sequencing;    error correction;    Computer Science;   
Others  :  https://uwspace.uwaterloo.ca/bitstream/10012/8996/1/Marinier_Eric.pdf
瑞士|英语
来源: UWSPACE Waterloo Institutional Repository
PDF
【 摘 要 】

The introduction of second-generation DNA sequencers has enabled researchers to explore biological information in ways never before possible. These sequencers provide increased throughput over first-generation sequencers at decreasing costs. However, the information produced by these sequencing technologies contains errors which may complicate downstream analyses. The error correction problem involves locating sequencing errors and making edits that correct or remove errors. We introduce Pollux, a platform-independent error corrector which identifies and fixes errors produced by second-generation sequencing technologies. We evaluate Pollux on several diploid bacterial data sets. Using standardized test data, Pollux corrects 85% of Roche 454 GS Junior, 86% of Ion Torrent PGM, and 94% of Illumina MiSeq errors. We compare Pollux to several current error correctors. Pollux performs comparably with the most effective correctors when correcting Illumina data and makes significant improvements when correcting Roche 454 and Ion Torrent PGM data. Furthermore, we provide evidence that Pollux can correct errors in the presence of varying coverage and improves the quality of sequence assemblies.

【 预 览 】
附件列表
Files Size Format View
Error Correction of Second-Generation Sequencing Reads 514KB PDF download
  文献评价指标  
  下载次数:9次 浏览次数:30次