期刊论文详细信息
BMC Genomics
Genome reassembly with high-throughput sequencing data
Proceedings
Nathaniel Parrish1  Eleazar Eskin1  Benjamin Sudakov2 
[1] Department of Computer Science, University of California Los Angeles, Los Angeles, California, USA;Department of Mathematics, University of California Los Angeles, Los Angeles, California, USA;
关键词: Reference Genome;    Graph Construction;    Reference Marker;    Donor Genome;    Euler Tour;   
DOI  :  10.1186/1471-2164-14-S1-S8
来源: Springer
PDF
【 摘 要 】

MotivationRecent studies in genomics have highlighted the significance of structural variation in determining individual variation. Current methods for identifying structural variation, however, are predominantly focused on either assembling whole genomes from scratch, or identifying the relatively small changes between a genome and a reference sequence. While significant progress has been made in recent years on both de novo assembly and resequencing (read mapping) methods, few attempts have been made to bridge the gap between them.ResultsIn this paper, we present a computational method for incorporating a reference sequence into an assembly algorithm. We propose a novel graph construction that builds upon the well-known de Bruijn graph to incorporate the reference, and describe a simple algorithm, based on iterative message passing, which uses this information to significantly improve assembly results. We validate our method by applying it to a series of 5 Mb simulation genomes derived from both mammalian and bacterial references. The results of applying our method to this simulation data are presented along with a discussion of the benefits and drawbacks of this technique.

【 授权许可】

Unknown   
© Parrish et al.; licensee BioMed Central Ltd. 2013. This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

【 预 览 】
附件列表
Files Size Format View
RO202311099573886ZK.pdf 1276KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  文献评价指标  
  下载次数:2次 浏览次数:0次