期刊论文详细信息
BMC Genomics
Testing assembly strategies of Francisella tularensis genomes to infer an evolutionary conservation analysis of genomic structures
Robert Maximilian Leidenfrost1  Röbbe Wünschiers1  Knut Reinert2  Kerstin Neubert3  Herbert Tomaso4  Helmut Hotzel4  Eric Zuchantke4  Anne Busch5  Timo Homeier6  Sascha Al Dahouk7  Josephine Grützke7  Burkhard Malorny7  Holger Brendebach7 
[1] Department of Biotechnology and Chemistry, Mittweida University of Applied Sciences, Technikumplatz 17a, 09648, Mittweida, Germany;Department of Mathematics and Computer Science, Algorithmic Bioinformatics, Freie Universität Berlin, Institute of Computer Science, Takustr. 9, 14195, Berlin, Germany;Department of Mathematics and Computer Science, Algorithmic Bioinformatics, Freie Universität Berlin, Institute of Computer Science, Takustr. 9, 14195, Berlin, Germany;German Federal Institute for Risk Assessment, Diedersdorfer Weg 1, 12277, Berlin, Germany;Friedrich-Loeffler-Institut, Institute of Bacterial Infections and Zoonoses, Naumburger Str. 96a, 07749, Jena, Germany;Friedrich-Loeffler-Institut, Institute of Bacterial Infections and Zoonoses, Naumburger Str. 96a, 07749, Jena, Germany;Department of Anaesthesiology and Intensive Care Medicine, University Hospital Jena, Jena, Germany;Friedrich-Loeffler-Institut, Institute of Epidemiology, Südufer, 10 17493, Greifswald, Insel Riems, Germany;German Federal Institute for Risk Assessment, Diedersdorfer Weg 1, 12277, Berlin, Germany;
关键词: Francisella;    Insertion sequences;    High-throughput sequencing;    Short-read assembly;    Hybrid assembly;    Ion Torrent’s ion S5;    Illumina MiSeq;    Illumina HiSeq;    Pacific biosciences RS;    Oxford Nanopore technologies MinION;   
DOI  :  10.1186/s12864-021-08115-x
来源: Springer
PDF
【 摘 要 】

BackgroundWe benchmarked sequencing technology and assembly strategies for short-read, long-read, and hybrid assemblers in respect to correctness, contiguity, and completeness of assemblies in genomes of Francisella tularensis. Benchmarking allowed in-depth analyses of genomic structures of the Francisella pathogenicity islands and insertion sequences. Five major high-throughput sequencing technologies were applied, including next-generation “short-read” and third-generation “long-read” sequencing methods.ResultsWe focused on short-read assemblers, hybrid assemblers, and analysis of the genomic structure with particular emphasis on insertion sequences and the Francisella pathogenicity island. The A5-miseq pipeline performed best for MiSeq data, Mira for Ion Torrent data, and ABySS for HiSeq data from eight short-read assembly methods. Two approaches were applied to benchmark long-read and hybrid assembly strategies: long-read-first assembly followed by correction with short reads (Canu/Pilon, Flye/Pilon) and short-read-first assembly along with scaffolding based on long reads (Unicyler, SPAdes). Hybrid assembly can resolve large repetitive regions best with a “long-read first” approach.ConclusionsGenomic structures of the Francisella pathogenicity islands frequently showed misassembly. Insertion sequences (IS) could be used to perform an evolutionary conservation analysis. A phylogenetic structure of insertion sequences and the evolution within the clades elucidated the clade structure of the highly conservative F. tularensis.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202203047811989ZK.pdf 2609KB PDF download
  文献评价指标  
  下载次数:31次 浏览次数:5次