期刊论文详细信息
Wellcome Open Research
Ultraplex: A rapid, flexible, all-in-one fastq demultiplexer
article
Oscar G Wilkins1  Charlotte Capitanchik1  Nicholas M. Luscombe1  Jernej Ule1 
[1] The Francis Crick Institute;Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology;UCL Genetics Institute, Department of Genetics, Environment and Evolution, University College London;Okinawa Institute of Science & Technology Graduate University
关键词: Demultiplexing;    fastq;    iCLIP;    UMI;    ribosome profiling;   
DOI  :  10.12688/wellcomeopenres.16791.1
学科分类:内科医学
来源: Wellcome
PDF
【 摘 要 】

Background: The first step of virtually all next generation sequencing analysis involves the splitting of the raw sequencing data into separate files using sample-specific barcodes, a process known as “demultiplexing”. However, we found that existing software for this purpose was either too inflexible or too computationally intensive for fast, streamlined processing of raw, single end fastq files containing combinatorial barcodes.Results: Here, we introduce a fast and uniquely flexible demultiplexer, named Ultraplex, which splits a raw FASTQ file containing barcodes either at a single end or at both 5’ and 3’ ends of reads, trims the sequencing adaptors and low-quality bases, and moves unique molecular identifiers (UMIs) into the read header, allowing subsequent removal of PCR duplicates. Ultraplex is able to perform such single or combinatorial demultiplexing on both single- and paired-end sequencing data, and can process an entire Illumina HiSeq lane, consisting of nearly 500 million reads, in less than 20 minutes.Conclusions: Ultraplex greatly reduces computational burden and pipeline complexity for the demultiplexing of complex sequencing libraries, such as those produced by various CLIP and ribosome profiling protocols, and is also very user friendly, enabling streamlined, robust data processing. Ultraplex is available on PyPi and Conda and viaGithub.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202307130000994ZK.pdf 1927KB PDF download
  文献评价指标  
  下载次数:2次 浏览次数:0次