GigaScience | |
BioMAJ2Galaxy: automatic update of reference data in Galaxy using BioMAJ | |
Olivier Collin2  Fabrice Legeai1  Yvan Le Bras2  Cyril Monjeaud2  Anthony Bretaudeau2  | |
[1] INRIA, IRISA, GenScale, Campus de Beaulieu, Rennes 35042, France;INRIA, IRISA, GenOuest Core Facility, Campus de Beaulieu, Rennes 35042, France | |
关键词: Data libraries; Data manager; Reference data; Galaxy; BioMAJ; | |
Others : 1204329 DOI : 10.1186/s13742-015-0063-8 |
|
received in 2014-12-22, accepted in 2015-04-22, 发布年份 2015 | |
【 摘 要 】
Background
Many bioinformatics tools use reference data, such as genome assemblies or sequence databanks. Galaxy offers multiple ways to give access to this data through its web interface. However, the process of adding new reference data was customarily manual and time consuming, even more so when this data needed to be indexed in a variety of formats (e.g. Blast, Bowtie, BWA, or 2bit).
BioMAJ is a widely used and stable software that is designed to automate the download and transformation of data from various sources. This data can be used directly from the command line, in more complex systems, such as Mobyle, or by using a REST API.
Findings
To ease the process of giving access to reference data in Galaxy, we have developed the BioMAJ2Galaxy module, which enables the gap between BioMAJ and Galaxy to be bridged. With this module, it is now possible to configure BioMAJ to automatically download some reference data, to then convert and/or index it in various formats, and then make this data available in a Galaxy server using data libraries or data managers.
Conclusions
The developments presented in this paper allow us to integrate the reference data in Galaxy in an automatic, reliable, and diskspace-saving way. The code is freely available on the GenOuest GitHub account (https://github.com/genouest/biomaj2galaxy webcite).
【 授权许可】
2015 Bretaudeau et al.; licensee BioMed Central.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
20150524040358375.pdf | 1034KB | download | |
Figure 2. | 99KB | Image | download |
Figure 1. | 42KB | Image | download |
【 图 表 】
Figure 1.
Figure 2.
【 参考文献 】
- [1]Goecks J, Nekrutenko A, Taylor J. Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 2010; 11:R86. BioMed Central Full Text
- [2]Blankenberg D, Kuster GV, Coraor N, Ananda G, Lazarus R, Mangan M et al.. Galaxy: a web-based genome analysis tool for experimentalists. Curr Protoc Mol Biol. 2010; Chapter 19(Unit 19.10):1-21.
- [3]Giardine B, Riemer C, Hardison RC, Burhans R, Elnitski L, Shah P et al.. Galaxy: a platform for interactive large-scale genome analysis. Genome Res. 2005; 15:1451-5.
- [4]Goecks J, Eberhard C, Too T, Nekrutenko A, Taylor J. Web-based visual analysis for high-throughput genomics. BMC Genomics. 2013; 14:397. BioMed Central Full Text
- [5]The universal protein resource (UniProt). Nucl Acids Res. 2008; 36:D190-5.
- [6]Benson DA, Cavanaugh M, Clark K, Karsch-Mizrachi I, Lipman DJ, Ostell J et al.. Genbank. Nucl Acids Res. 2013; 41:D36-42.
- [7]Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012; 9:357-9.
- [8]Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009; 25:1754-60.
- [9]Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K et al.. Blast+: architecture and applications. BMC Bioinform. 2009; 10:421. BioMed Central Full Text
- [10]Blankenberg D, Johnson JE, Taylor J, Nekrutenko A. Wrangling Galaxy’s reference data. Bioinformatics. 2014; 30:1917-9.
- [11]Pennsylvania State University. Galaxy Main Tool Shed. Accessed 04 17 2015. https://toolshed.g2.bx.psu.edu/.
- [12]Filangi O, Beausse Y, Assi A, Legrand L, Larré J-M, Martin V et al.. BioMaj: a flexible framework for databanks synchronization and processing. Bioinformatics. 2008; 24:1823-5.
- [13]Sloggett C, Goonasekera N, Afgan E. BioBlend: automating pipeline analyses within Galaxy and Cloudman. Bioinformatics. 2013; 29:1685-6.
- [14]Genouest BioInformatics Platform. BioMAJ2Galaxy GitHub Repository. Accessed 04 17 2015. https://github.com/genouest/biomaj2galaxy.
- [15]GenOuest BioInformatics Platform. GUGGO Tool Shed. Accessed 04 17 2015. http://toolshed.genouest.org.
- [16]Biogenouest. GUGGO Web Site. Accessed 04 17 2015. https://www.e-biogenouest.org/groups/guggo.
- [17]Galaxy. Galaxy Contribution #577. Accessed 04 17 2015. https://bitbucket.org/galaxy/galaxy-central/pull-request/577/.
- [18]Galaxy. Galaxy Contribution #601. Accessed 04 17 2015. https://bitbucket.org/galaxy/galaxy-central/pull-request/601/.
- [19]Galaxy Project. BioBlend Contribution #105. Accessed 04 17 2015. https://github.com/afgane/bioblend/pull/105.
- [20]Néron B, Ménager H, Maufrais C, Joly N, Maupetit J, Letort S et al.. Mobyle: a new full web bioinformatics framework. Bioinformatics. 2009; 25:3005-11.
- [21]GenOuest BioInformatics Platform. GenOuest Galaxy Instance. Accessed 04 17 2015. http://galaxy.genouest.org.
- [22]GenOuest BioInformatics Platform. BIPAA Galaxy Instance. Accessed 04 17 2015. http://bipaa-galaxy.genouest.org.
- [23]Bretaudeau A, Monjeaud C, Le Bras Y, Legeai F, Collin O. Software and supporting material for “BioMAJ2Galaxy: automatic update of reference data in Galaxy using BioMAJ”. GigaScience Database. 2015. http://doi.org/10.5524/100138.