期刊论文详细信息
BMC Bioinformatics
Mapsembler, targeted and micro assembly of large NGS datasets on a desktop computer
Research Article
Rayan Chikhi1  Pierre Peterlongo2 
[1] ENS Cachan/IRISA, EPI Symbiose, Rennes, France;INRIA Rennes - Bretagne Atlantique, EPI Symbiose, Rennes, France;
关键词: Reference Genome;    Sequence Fragment;    Read Position;    Extension Graph;    Read File;   
DOI  :  10.1186/1471-2105-13-48
 received in 2011-09-02, accepted in 2012-02-24,  发布年份 2012
来源: Springer
PDF
【 摘 要 】

BackgroundThe analysis of next-generation sequencing data from large genomes is a timely research topic. Sequencers are producing billions of short sequence fragments from newly sequenced organisms. Computational methods for reconstructing whole genomes/transcriptomes (de novo assemblers) are typically employed to process such data. However, these methods require large memory resources and computation time. Many basic biological questions could be answered targeting specific information in the reads, thus avoiding complete assembly.ResultsWe present Mapsembler, an iterative micro and targeted assembler which processes large datasets of reads on commodity hardware. Mapsembler checks for the presence of given regions of interest that can be constructed from reads and builds a short assembly around it, either as a plain sequence or as a graph, showing contextual structure. We introduce new algorithms to retrieve approximate occurrences of a sequence from reads and construct an extension graph. Among other results presented in this paper, Mapsembler enabled to retrieve previously described human breast cancer candidate fusion genes, and to detect new ones not previously known.ConclusionsMapsembler is the first software that enables de novo discovery around a region of interest of repeats, SNPs, exon skipping, gene fusion, as well as other structural events, directly from raw sequencing reads. As indexing is localized, the memory footprint of Mapsembler is negligible. Mapsembler is released under the CeCILL license and can be freely downloaded fromhttp://alcovna.genouest.org/mapsembler/.

【 授权许可】

CC BY   
© Peterlongo and Chikhi; licensee BioMed Central Ltd. 2012

【 预 览 】
附件列表
Files Size Format View
RO202311104272267ZK.pdf 2156KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  文献评价指标  
  下载次数:6次 浏览次数:0次