期刊论文详细信息
BMC Bioinformatics
TIGER: tiled iterative genome assembler
Proceedings
Jian Ma1  Wen-Mei Hwu2  Deming Chen2  Xiao-Long Wu2  Izzat El Hajj2  Yun Heo2 
[1] Department of Bioengineering, University of Illinois at Urbana-Champaign, 61801, Urbana, IL, USA;Institute for Genomic Biology, University of Illinois at Urbana-Champaign, 61801, Urbana, IL, USA;Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, 61801, Urbana, IL, USA;
关键词: Genome Assembly;    Memory Usage;    Target Genome;    Assembly Result;    Contig Length;   
DOI  :  10.1186/1471-2105-13-S19-S18
来源: Springer
PDF
【 摘 要 】

BackgroundWith the cost reduction of the next-generation sequencing (NGS) technologies, genomics has provided us with an unprecedented opportunity to understand fundamental questions in biology and elucidate human diseases. De novo genome assembly is one of the most important steps to reconstruct the sequenced genome. However, most de novo assemblers require enormous amount of computational resource, which is not accessible for most research groups and medical personnel.ResultsWe have developed a novel de novo assembly framework, called Tiger, which adapts to available computing resources by iteratively decomposing the assembly problem into sub-problems. Our method is also flexible to embed different assemblers for various types of target genomes. Using the sequence data from a human chromosome, our results show that Tiger can achieve much better NG50s, better genome coverage, and slightly higher errors, as compared to Velvet and SOAPdenovo, using modest amount of memory that are available in commodity computers today.ConclusionsMost state-of-the-art assemblers that can achieve relatively high assembly quality need excessive amount of computing resource (in particular, memory) that is not available to most researchers to achieve high quality results. Tiger provides the only known viable path to utilize NGS de novo assemblers that require more memory than that is present in available computers. Evaluation results demonstrate the feasibility of getting better quality results with low memory footprint and the scalability of using distributed commodity computers.

【 授权许可】

Unknown   
© Wu et al.; licensee BioMed Central Ltd. 2012. This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

【 预 览 】
附件列表
Files Size Format View
RO202311094419515ZK.pdf 983KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  文献评价指标  
  下载次数:0次 浏览次数:0次