期刊论文详细信息
Journal of Biomedical Semantics
The 3rd DBCLS BioHackathon: improving life science data integration with Semantic Web technologies
Toshihisa Takagi1  Kiyoshi Asai1  Kosaku Okubo1  Shoko Kawamoto1  Christian M Zmasek1  Chisato Yamasaki1  David Withers1  Rutger A Vos1  James Taylor1  Hideaki Sugawara1  Andrea Splendiani1  Richard Smith1  Yasumasa Shigemoto1  Venkata P Satagopam1  Matthias Samwald1  Taro L Saito1  Pjotr Prins1  Keun-Joon Park1  Kazuki Oshita1  Keiichiro Ono1  Soichi Ogishima1  Tatsuya Nishizawa1  Kunihiro Nishimura1  Kozo Nishida1  Koji Nagao1  Katsuhiko Murakami1  Hideyuki Morita1  Luke McCarthy1  Pierre Lindenbaum1  Yu Lin1  Anna-Lena Lamprecht1  Alberto Labarga1  Kyung-Hoon Kwon1  Fumikazu Konishi1  Akira R Kinjo1  Young Joo Kim1  Nobuhiro Kido1  Hideya Kawaji1  Arek Kasprzyk1  Eli Kaminuma1  Ryosuke Ishiwata1  Heiko Horn1  Kazuhiro Hayashi1  Naohisa Goto1  Paul MK Gordon1  Tore Eriksson1  Peter JA Cock1  Brad Chapman1  Raoul JP Bonnal1  Jerven Bolleman1  Francois Belleau1  Bruno Aranda1  Kazuharu Arakawa1  Erick Antezana1  Hammad Afzal1  Jan Aerts1  Hong-Woo Chun1  Kenta Oouchida1  Shinobu Okamoto1  Yasunori Yamamoto1  Mitsuteru Nakao1  Atsuko Yamaguchi1  Shuichi Kawashima1  Gos Micklem1  Mark D Wilkinson1  Toshiaki Katayama1 
[1]Database Center for Life Science, Research Organization of Information and Systems, 2-11-16, Yayoi, Bunkyo-ku, Tokyo, 113-0032, Japan
关键词: Interfaces;    Web services;    Data visualization;    Data integration;    Databases;    Semantic Web;    Software;    Open source;    BioHackathon;   
Others  :  812403
DOI  :  10.1186/2041-1480-4-6
 received in 2012-09-13, accepted in 2013-02-05,  发布年份 2013
PDF
【 摘 要 】

Background

BioHackathon 2010 was the third in a series of meetings hosted by the Database Center for Life Sciences (DBCLS) in Tokyo, Japan. The overall goal of the BioHackathon series is to improve the quality and accessibility of life science research data on the Web by bringing together representatives from public databases, analytical tool providers, and cyber-infrastructure researchers to jointly tackle important challenges in the area of in silico biological research.

Results

The theme of BioHackathon 2010 was the 'Semantic Web', and all attendees gathered with the shared goal of producing Semantic Web data from their respective resources, and/or consuming or interacting those data using their tools and interfaces. We discussed on topics including guidelines for designing semantic data and interoperability of resources. We consequently developed tools and clients for analysis and visualization.

Conclusion

We provide a meeting report from BioHackathon 2010, in which we describe the discussions, decisions, and breakthroughs made as we moved towards compliance with Semantic Web technologies - from source provider, through middleware, to the end-consumer.

【 授权许可】

   
2013 Katayama et al; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20140709083706227.pdf 2123KB PDF download
Figure 3. 107KB Image download
Figure 1. 121KB Image download
【 图 表 】

Figure 1.

Figure 3.

【 参考文献 】
  • [1]Smedley D, Haider S, Ballester B, et al.: BioMart–biological queries made easy. BMC Genomics 2009, 10:22. BioMed Central Full Text
  • [2]Smith RN, Aleksic J, Butano D, et al.: InterMine: a flexible data warehouse system for the integration and analysis of heterogeneous biological data. Bioinformatics 2012, 28:3163-3165.
  • [3]Antezana E, Kuiper M, Mironov V: Biological knowledge management: the emerging role of the Semantic Web technologies. Brief Bioinform 2009, 10:392-407.
  • [4]Clinical Observations Interoperability. http://www.w3.org/wiki/HCLS/ClinicalObservationsInteroperability webcite
  • [5]Samwald M, Jentzsch A, Bouton C, et al.: Linked open drug data for pharmaceutical research and development. J Cheminformatics 2011, 3:19. BioMed Central Full Text
  • [6]Belleau F, Nolin M-A, Tourigny N, Rigault P, Morissette J: Bio2RDF: towards a mashup to build bioinformatics knowledge systems. J Biomed Inform 2008, 41:706-716.
  • [7]Bio2RDFhttp://bio2rdf.org/ webcite
  • [8]Bizer C, Heath T, Berners-Lee T: Linked data - The story so far. J Semantic Web Inf Syst 2009, 5:1-22.
  • [9]Goto N, Prins P, Nakao M, et al.: BioRuby: bioinformatics software for the Ruby programming language. Bioinformatics 2010, 26:2617-2619.
  • [10]BioRubyhttp://bioruby.org/ webcite
  • [11]Cock PJA, Antao T, Chang JT, et al.: Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 2009, 25:1422-1423.
  • [12]BioPython. http://biopython.org/ webcite
  • [13]Wilkinson MD, McCarthy L, Vandervalk B, et al.: SADI, SHARE, and the in silico scientific method. BMC Bioinformatics 2010, 11(Suppl 1):S7. BioMed Central Full Text
  • [14]SADIhttp://sadiframework.org/ webcite
  • [15]Arakawa K, Kido N, Oshita K, Tomita M: G-language genome analysis environment with REST and SOAP web service interfaces. Nucleic Acids Res 2010, 38:W700-W705.
  • [16]G-language. http://www.g-language.org/ webcite
  • [17]Hull D, Wolstencroft K, Stevens R, et al.: Taverna: a tool for building and running workflows of services. Nucleic Acids Res 2006, 34:W729-W732.
  • [18]Lyne R, Smith R, Rutherford K, et al.: FlyMine: an integrated database for Drosophila and Anopheles genomics. Genome Biol 2007, 8:R129. BioMed Central Full Text
  • [19]FlyMinehttp://www.flymine.org/ webcite
  • [20]Lamprecht A-L, Margaria T, Steffen B: Bio-jETI: a framework for semantics-based service composition. BMC Bioinformatics 2009, 10(Suppl 1):S8. BioMed Central Full Text
  • [21]Bio-jETIhttp://biojeti.cs.tu-dortmund.de/ webcite
  • [22]Smoot ME, Ono K, Ruscheinski J, Wang P-L, Ideker T: Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics 2011, 27:431-432.
  • [23]Cytoscape. http://www.cytoscape.org/ webcite
  • [24]BioHackathon 2010 participants. http://hackathon3.dbcls.jp/wiki/Participants webcite
  • [25]Magrane M, Uniprot Consortium: UniProt Knowledgebase: a hub of integrated protein data. Database 2011, 2011:bar009.
  • [26]UniProthttp://www.uniprot.org/ webcite
  • [27]Korean HapMap Project. http://www.khapmap.org/ webcite
  • [28]TreeBasehttp://www.treebase.org/ webcite
  • [29]Kaminuma E, Kosuge T, Kodama Y, et al.: DDBJ progress report. Nucleic Acids Res 2011, 39:D22-D27.
  • [30]INSCDhttp://www.insdc.org/ webcite
  • [31]PDBjhttp://www.pdbj.org/ webcite
  • [32]Kanehisa M, Goto S, Furumichi M, Tanabe M, Hirakawa M: KEGG for representation and analysis of molecular networks involving diseases and drugs. Nucleic Acids Res 2010, 38:D355-D360.
  • [33]KEGGhttp://www.kegg.jp/ webcite
  • [34]DBCLShttp://dbcls.rois.ac.jp/ webcite
  • [35]Aranda B, Achuthan P, Alam-Faruque Y, et al.: The IntAct molecular interaction database in 2010. Nucleic Acids Res 2010, 38:D525-D531.
  • [36]IntAct. http://www.ebi.ac.uk/intact/ webcite
  • [37]Antezana E, Blondé W, Egaña M, et al.: BioGateway: a semantic systems biology tool for the life sciences. BMC Bioinformatics 2009, 10(Suppl 1):S11. BioMed Central Full Text
  • [38]BioGateway. http://www.semantic-systems-biology.org/biogateway webcite
  • [39]DERIhttp://www.deri.org/ webcite
  • [40]Aranda B, Blankenburg H, Kerrien S, et al.: PSICQUIC and PSISCORE: accessing and scoring molecular interactions. Nat Methods 2011, 8:528-529.
  • [41]PSICQUIChttp://psicquic.googlecode.com/ webcite
  • [42]The HUPO Proteomics Standards Initiative. http://www.psidev.info/ webcite
  • [43]Stearns MQ, Price C, Spackman KA, Wang AY: SNOMED clinical terms: overview of the development process and project status. 2001, 662-666. [Proceedings of AMIA Symposium]
  • [44]Rogers J, Roberts A, Solomon D, et al.: GALEN Ten Years On: Tasks and Supporting Tools. Proc Medinfo 2001, 256-260.
  • [45]Ashburner M, Ball CA, Blake JA, et al.: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000, 25:25-29.
  • [46]Gruber TR: A translation approach to portable ontology specifications. Knowledge Acquisition 1993, 5:199-220.
  • [47]Chandrasekaran B, Josephson JR, Benjamins VR: What are ontologies, and why do we need them? IEEE Intell Syst 1999, 14:20-26.
  • [48]Walls RL, Athreya B, Cooper L, et al.: Ontologies as integrative tools for plant science. Am J Bot 2012, 99:1263-1275.
  • [49]Courtot M, Juty N, Knüpfer C, et al.: Controlled vocabularies and semantics in systems biology. Mol Syst Biol 2011, 7:543.
  • [50]Abidi SR, Abidi SSR, Hussain S, Shepherd M: Ontology-based modeling of clinical practice guidelines: a clinical decision support system for breast cancer follow-up interventions at primary care settings. Stud Health Technol Inform 2007, 129:845-849.
  • [51]RDF/XML Syntax Specification (Revised). http://www.w3.org/TR/REC-rdf-syntax/ webcite
  • [52]Notation3 (N3): A readable RDF syntax. http://www.w3.org/TeamSubmission/n3/ webcite
  • [53]SPARQLhttp://www.w3.org/TR/rdf-sparql-query/ webcite
  • [54]Antezana E, Egaña M, Blondé W, et al.: The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process. Genome Biol 2009, 10:R58. BioMed Central Full Text
  • [55]UniProt ftpftp://ftp.uniprot.org/pub/databases/uniprot/current_release/rdf/ webcite
  • [56]Kosuge T, Abe T, Okido T, et al.: Exploration and grading of possible genes from 183 bacterial strains by a common protocol to identification of new genes: Gene Trek in Prokaryote Space (GTPS). DNA Res 2006, 13:245-254.
  • [57]Kinjo AR, Suzuki H, Yamashita R, et al.: Protein Data Bank Japan (PDBj): maintaining a structural data archive and resource description framework format. Nucleic Acids Res 2012, 40:D453-D460.
  • [58]Katayama T, Nakao M, Takagi T: TogoWS: integrated SOAP and REST APIs for interoperable bioinformatics Web services. Nucleic Acids Res 2010, 38:W706-W711.
  • [59]TogoWShttp://togows.dbcls.jp/ webcite
  • [60]TogoDBhttp://togodb.dbcls.jp/ webcite
  • [61]W3C Cool URIhttp://www.w3.org/TR/cooluris/ webcite
  • [62]Polite URIhttp://hackathon3.dbcls.jp/wiki/URI webcite
  • [63]FreeBasehttp://www.freebase.com/view/base/politeuri/views/9202a8c04000641f8000000013e0f893 webcite
  • [64]LSRNhttp://lsrn.org/ webcite
  • [65]Identifiers.orghttp://identifiers.org/ webcite
  • [66]Semanticscience Integrated Ontology. http://code.google.com/p/semanticscience/wiki/SIO webcite
  • [67]Ceol A, Chatr Aryamontri A, Licata L, et al.: MINT, the molecular interaction database: 2009 update. Nucleic Acids Res 2010, 38:D532-D539.
  • [68]Stark C, Breitkreutz B-J, Chatr-Aryamontri A, et al.: The BioGRID Interaction Database: 2011 update. Nucleic Acids Res 2011, 39:D698-D704.
  • [69]Razick S, Magklaras G, Donaldson IM: iRefIndex: a consolidated protein interaction database with provenance. BMC Bioinformatics 2008, 9:405. BioMed Central Full Text
  • [70]Chautard E, Fatoux-Ardore M, Ballut L, Thierry-Mieg N, Ricard-Blum S: MatrixDB, the extracellular matrix interaction database. Nucleic Acids Res 2011, 39:D235-D240.
  • [71]Croft D, O’Kelly G, Wu G, et al.: Reactome: a database of reactions, pathways and biological processes. Nucleic Acids Res 2011, 39:D691-D697.
  • [72]Goll J, Rajagopala SV, Shiau SC, et al.: MPIDB: the microbial protein interaction database. Bioinformatics 2008, 24:1743-1744.
  • [73]ChEMBLhttps://www.ebi.ac.uk/chembl/ webcite
  • [74]PSICQUIC Registry. http://www.ebi.ac.uk/Tools/webservices/psicquic/registry/registry?action=STATUS webcite
  • [75]Demir E, Cary MP, Paley S, et al.: The BioPAX community standard for pathway data sharing. Nat Biotechnol 2010, 28:935-942.
  • [76]PSI-MI XML. http://www.psidev.info/mif webcite
  • [77]Apache Jena. http://jena.apache.org/ webcite
  • [78]Funahashi A, Matsuoka Y, Jouraku A, et al.: CellDesigner 3.5: A Versatile Modeling Tool for Biochemical Networks. Proc IEEE 2008, 96:1254-1265.
  • [79]Hucka M, Finney A, Sauro HM, et al.: The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics 2003, 19:524-531.
  • [80]Mizuno S, Iijima R, Ogishima S, Kikuchi M, Matsuoka Y, Ghosh S, Miyamoto T, Miyashita A, Kuwano RTH: AlzPathway: a comprehensive map of signaling pathways of Alzheimer’s disease. BMC Syst Biol 2012, 6:52. BioMed Central Full Text
  • [81]AlzPathway. http://alzpathway.org/ webcite
  • [82]SBML to BioPax. http://www.ebi.ac.uk/compneur-srv/sbml/converters/SBMLtoBioPax.html webcite
  • [83]Oda K, Matsuoka Y, Funahashi A, Kitano H: A comprehensive pathway map of epidermal growth factor receptor signaling. Mol Syst Biol 2005, 2005.0010:1-17.
  • [84]4store. http://4store.org/ webcite
  • [85]Page RDM: Taxonomic names, metadata, and the semantc web. Biodiversity Inform 2006, 3:1-15.
  • [86]Soberón J, Peterson AT: Biodiversity informatics: managing and applying primary biodiversity data. Philos Trans R Soc Lond B Biol Sci 2004, 359:689-698.
  • [87]Ley RE, Turnbaugh PJ, Klein S, Gordon JI: Human gut microbes associated with obesity. Nature 2006, 444:1022-1023.
  • [88]Rusch DB, Halpern AL, Sutton G, et al.: The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific. PLoS Biol 2007, 5:e77.
  • [89]TDWG ontology. http://wiki.tdwg.org/twiki/bin/view/TAG/TDWGOntology webcite
  • [90]DarwinCore. http://rs.tdwg.org/dwc/ webcite
  • [91]Prosdocimi F, Chisham B, Pontelli E, Thompson J, Stoltzfus A: Initial Implementation of a comparative Data Analysis Ontology. Evol Bioinformatics 2009, 5:47-66.
  • [92]Rebholz-Schuhmann D, Arregui M, Gaudan S, Kirsch H, Jimeno A: Text processing through Web services: calling Whatizit. Bioinformatics 2008, 24:296-298.
  • [93]Whatizit. http://www.ebi.ac.uk/webservices/whatizit webcite
  • [94]O’Donoghue SI, Horn H, Pafilis E, et al.: Reflect: A practical approach to web semantics. Web Semantics 2010, 8:182-189.
  • [95]Prosdocimi F, Chisham B, Pontelli E, Thompson J, Stoltzfus A: Initial Implementation of a comparative Data Analysis Ontology. EvolBioinformatics 2009, 5:47-66.
  • [96]Reflect. http://reflect.ws/ webcite
  • [97]Ananiadou S, Kell DB, Tsujii J: Text mining and its potential applications in systems biology. Trends Biotechnol 2006, 24:571-579.
  • [98]Medie. http://www.nactem.ac.uk/medie/ webcite
  • [99]RDFa. http://rdfa.info/ webcite
  • [100]Splendiani A: RDFScape: Semantic Web meets systems biology. BMC Bioinformatics 2008, 9(Suppl 4):S6. BioMed Central Full Text
  • [101]Goecks J, Nekrutenko A, Taylor J: Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol 2010, 11:R86. BioMed Central Full Text
  • [102]Galaxy. http://main.g2.bx.psu.edu/ webcite
  • [103]EBI Ontology Lookup Service. http://www.ebi.ac.uk/ontology-lookup/ webcite
  • [104]Flicek P, Amode MR, Barrell D, et al.: Ensembl 2011. Nucleic Acids Res 2011, 39:D800-D806.
  • [105]Ensembl. http://www.ensembl.org/ webcite
  • [106]Dreszer TR, Karolchik D, Zweig AS, et al.: The UCSC Genome Browser database: extensions and updates 2011. Nucleic Acids Res 2012, 40:D918-D923.
  • [107]Contrino S, Smith RN, Butano D, et al.: modMine: flexible access to modENCODE data. Nucleic Acids Res 2012, 40:D1082-D1088.
  • [108]EDAM Ontology. http://edamontology.org/ webcite
  • [109]Wilkinson MD, Senger M, Kawas E, et al.: Interoperability with Moby 1.0--it’s better than sharing your toothbrush! Brief Bioinform 2008, 9:220-231.
  • [110]Heim P, Hellmann S, Lehmann J, Lohmann S: RelFinder: Revealing Relationships in RDF Knowledge Bases. Proc 4th Int Conf Semantic Digital Media Technol, (Lecture Notes Comput Sci) 2009, 5887:182-187.
  • [111]RelFinderhttp://www.visualdataweb.org/relfinder.php webcite
  • [112]G-language Bookmarklet. http://www.g-language.org/wiki/bookmarklet webcite
  • [113]Wilkinson MD, Vandervalk B, McCarthy L: The Semantic Automated Discovery and Integration (SADI) Web service Design-Pattern, API and Reference Implementation. J Biomed Semantics 2011, 2:8. BioMed Central Full Text
  • [114]CPANhttp://www.cpan.org/ webcite
  • [115]SADI framework. http://code.google.com/p/sadi/ webcite
  • [116]Daggoo. http://www.daggoo.net/ webcite
  • [117]Evidence ontology. http://code.google.com/p/evidenceontology/ webcite
  文献评价指标  
  下载次数:27次 浏览次数:43次