| BMC Bioinformatics | |
| Bison: bisulfite alignment on nodes of a cluster | |
| Devon Patrick Ryan1  Dan Ehninger1  | |
| [1] German Center for Neurodegenerative Diseases (DZNE), Ludwig-Erhard-Allee 2, Bonn 53175, Germany | |
| 关键词: Computer cluster; Bisulfite sequencing; Alignment; DNA Methylation; | |
| Others : 1085457 DOI : 10.1186/1471-2105-15-337 |
|
| received in 2014-03-18, accepted in 2014-09-25, 发布年份 2014 | |
PDF
|
|
【 摘 要 】
Background
DNA methylation changes are associated with a wide array of biological processes. Bisulfite conversion of DNA followed by high-throughput sequencing is increasingly being used to assess genome-wide methylation at single-base resolution. The relative slowness of most commonly used aligners for processing such data introduces an unnecessarily long delay between receipt of raw data and statistical analysis. While this process can be sped-up by using computer clusters, current tools are not designed with them in mind and end-users must create such implementations themselves.
Results
Here, we present a novel BS-seq aligner, Bison, which exploits multiple nodes of a computer cluster to speed up this process and also has increased accuracy. Bison is accompanied by a variety of helper programs and scripts to ease, as much as possible, the process of quality control and preparing results for statistical analysis by a variety of popular R packages. Bison is also accompanied by bison_herd, a variant of Bison with the same output but that can scale to a semi-arbitrary number of nodes, with concomitant increased demands on the underlying message passing interface implementation.
Conclusions
Bison is a new bisulfite-converted short-read aligner providing end users easier scalability for performance gains, more accurate alignments, and a convenient pathway for quality controlling alignments and converting methylation calls into a form appropriate for statistical analysis. Bison and the more scalable bison_herd are natively able to utilize multiple nodes of a computer cluster simultaneously and serve to simplify to the process of creating analysis pipelines.
【 授权许可】
2014 Ryan and Ehninger; licensee BioMed Central Ltd.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| 20150113173547168.pdf | 1580KB | ||
| Figure 7. | 117KB | Image | |
| Figure 6. | 59KB | Image | |
| Figure 5. | 59KB | Image | |
| Figure 4. | 72KB | Image | |
| Figure 3. | 24KB | Image | |
| Figure 2. | 33KB | Image | |
| Figure 1. | 46KB | Image |
【 图 表 】
Figure 1.
Figure 2.
Figure 3.
Figure 4.
Figure 5.
Figure 6.
Figure 7.
【 参考文献 】
- [1]Jones PA: Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat Rev Genet 2012, 13:484-492.
- [2]Krueger F, Andrews SR: Bismark: a flexible aligner and methylation caller for bisulfite-Seq applications. Bioinformatics 2011, 27:1571-1572.
- [3]Krueger F, Kreck B, Franke A, Andrews SR: DNA methylome analysis using short bisulfite sequencing data. Nat Methods 2012, 9:145-151.
- [4]Kunde-Ramamoorthy G, Coarfa C, Laritsky E, Kessler NJ, Harris RA, Xu M, Chen R, Shen L, Milosavljevic A, Waterland RA: Comparison and quantitative verification of mapping algorithms for whole-genome bisulfite sequencing. Nucleic Acids Res 2014, 42:e43.
- [5]Hansen KD, Langmead B, Irizarry RA: BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions. Genome Biol 2012, 13:R83. BioMed Central Full Text
- [6]Guo W, Fiziev P, Yan W, Cokus S, Sun X, Zhang MQ, Chen PY, Pellegrini M: BS-Seeker2: a versatile aligning pipeline for bisulfite sequencing data. BMC Genomics 2013, 14:774. BioMed Central Full Text
- [7]Langmead B, Salzberg SL: Fast gapped-read alignment with Bowtie 2. Nat Methods 2012, 9:357-359.
- [8]Xi Y, Li W: BSMAP: whole genome bisulfite sequence MAPping program. BMC Bioinformatics 2009, 10:232. BioMed Central Full Text
- [9]Hong C, Clement NL, Clement S, Hammoud SS, Carrell DT, Cairns BR, Snell Q, Clement MJ, Johnson WE: Probabilistic alignment leads to improved accuracy and read coverage for bisulfite sequencing data. BMC Bioinformatics 2013, 14:337. BioMed Central Full Text
- [10]Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R: The sequence alignment/Map format and SAMtools. Bioinformatics 2009, 25:2078-2079.
- [11]Akalin A, Kormaksson M, Li S, Garrett-Bakelman FE, Figueroa ME, Melnick A, Mason CE: methylKit: a comprehensive R package for the analysis of genome-wide DNA methylation profiles. Genome Biol 2012, 13:R87. BioMed Central Full Text
- [12]Hebestreit K, Dugas M, Klein HU: Detection of significantly differentially methylated regions in targeted bisulfite sequencing data. Bioinformatics 2013, 29:1647-1653.
- [13]Akman K, Haaf T, Gravina S, Vijg J, Tresch A: Genomewide, quantitative analysis of DNA methylation from bisulfite sequencing data. Bioinformatics 2014, 30:1933-1934.
- [14]Burger L, Gaidatzis D, Schubeler D, Stadler MB: Identification of active regulatory regions from DNA methylation data. Nucleic Acids Res 2013, 41:e155.
- [15]Seisenberger S, Andrews S, Krueger F, Arand J, Walter J, Santos F, Popp C, Thienpont B, Dean W, Reik W: The dynamics of genome-wide DNA methylation reprogramming in mouse primordial germ cells. Mol Cell 2012, 48:849-862.
- [16]Molaro A, Hodges E, Fang F, Song Q, McCombie WR, Hannon GJ, Smith AD: Sperm methylation profiles reveal features of epigenetic inheritance and evolution in primates. Cell 2011, 146:1029-1041.
- [17]Smallwood SA, Tomizawa S, Krueger F, Ruf N, Carli N, Segonds-Pichon A, Sato S, Hata K, Andrews SR, Kelsey G: Dynamic CpG island methylation landscape in oocytes and preimplantation embryos. Nat Genet 2011, 43:811-814.
PDF