期刊论文详细信息
BMC Bioinformatics
SequelTools: a suite of tools for working with PacBio Sequel raw sequence data
Matthew B. Hufford1  David E. Hufnagel2  Arun S. Seetharam3 
[1] Department of Ecology, Evolution and Organismal Biology, Iowa State University, 50011, Ames, IA, USA;Department of Ecology, Evolution and Organismal Biology, Iowa State University, 50011, Ames, IA, USA;Virus and Prion Research Unit, National Animal Disease Center, USDA-ARS, 50010, Ames, IA, USA;Genome Informatics Facility, Iowa State University, 50011, Ames, IA, USA;
关键词: Genomics;    Next-generation sequencing;    Third-generation sequencing;    PacBio;    Sequel;   
DOI  :  10.1186/s12859-020-03751-8
来源: Springer
PDF
【 摘 要 】

BackgroundPacBio sequencing is an incredibly valuable third-generation DNA sequencing method due to very long read lengths, ability to detect methylated bases, and its real-time sequencing methodology. Yet, hitherto no tool was available for analyzing the quality of, subsampling, and filtering PacBio data.ResultsHere we present SequelTools, a command-line program containing three tools: Quality Control, Read Subsampling, and Read Filtering. The Quality Control tool quickly processes PacBio Sequel raw sequence data from multiple SMRTcells producing multiple statistics and publication-quality plots describing the quality of the data including N50, read length and count statistics, PSR, and ZOR. The Read Subsampling tool allows the user to subsample reads by one or more of the following criteria: longest subreads per CLR or random CLR selection. The Read Filtering tool provides options for normalizing data by filtering out certain low-quality scraps reads and/or by minimum CLR length. SequelTools is implemented in bash, R, and Python using only standard libraries and packages and is platform independent.ConclusionsSequelTools is a program that provides the only free, fast, and easy-to-use quality control tool, and the only program providing this kind of read subsampling and read filtering for PacBio Sequel raw sequence data, and is available at https://github.com/ISUgenomics/SequelTools.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202104269553331ZK.pdf 1916KB PDF download
  文献评价指标  
  下载次数:18次 浏览次数:1次