学位论文详细信息
Parallel algorithms for enabling fast and scalable analysis of high-throughput sequencing datasets
High-throughput sequencing;Parallel algorithms
Jammula, Nagakishore ; Aluru, Srinivas Electrical and Computer Engineering Vuduc, Richard Qureshi, Moinuddin Wills, Linda Gavrilovska, Ada ; Aluru, Srinivas
University:Georgia Institute of Technology
Department:Electrical and Computer Engineering
关键词: High-throughput sequencing;    Parallel algorithms;   
Others  :  https://smartech.gatech.edu/bitstream/1853/61740/1/JAMMULA-DISSERTATION-2019.pdf
美国|英语
来源: SMARTech Repository
PDF
【 摘 要 】

The objective of this research is to develop parallel algorithms for enabling fast and scalable analysis of large-scale high-throughput sequencing datasets. Genome of an organism consists of one or more long DNA sequences called chromosomes, each a sequence of bases. Depending on the organism, the length of the genome can vary from several thousand bases to several billion bases. Genome sequencing, which involves deciphering the sequence of bases of the genome, is an important tool in genomics research. Sequencing instruments widely deployed today can only read short DNA sequences. However, these instruments can read up to several billion such sequences at a time, and are used to sequence a large number of randomly generated short fragments from the genome. These fragments are a few hundred bases long and are commonly referred to as “reads”. This work specifically tackles three problems associated with high-throughput sequencing short read datasets: (1) Parallel read error correction for large-scale genomics datasets, (2) Partitioning of large-scale high-throughput sequencing datasets, and (3) Parallel compression of large-scale genomics datasets.

【 预 览 】
附件列表
Files Size Format View
Parallel algorithms for enabling fast and scalable analysis of high-throughput sequencing datasets 957KB PDF download
  文献评价指标  
  下载次数:10次 浏览次数:12次