| BMC Bioinformatics | |
| A unified framework for managing provenance information in translational research | |
| Research Article | |
| Todd Minning1  Olivier Bodenreider2  Amit P Sheth3  Priti Parikh3  Vinh Nguyen3  Satya S Sahoo4  | |
| [1] Center for Tropical and Emerging Global Diseases, Univeristy of Georgia, Athens, GA, USA;Lister Hill National Center for Biomedical Communications, National Library of Medicine, NIH, Bethesda, MD, USA;The Kno.e.sis Center, Department of Computer Science and Engineering, Wright State University, Dayton, OH, USA;The Kno.e.sis Center, Department of Computer Science and Engineering, Wright State University, Dayton, OH, USA;Division of Medical Informatics, School of Medicine, Case Western Reserve University, Cleveland, OH, USA; | |
| 关键词: Resource Description Framework; Unify Medical Language System; Query Operator; Query Pattern; Provenance Information; | |
| DOI : 10.1186/1471-2105-12-461 | |
| received in 2011-05-03, accepted in 2011-11-29, 发布年份 2011 | |
| 来源: Springer | |
PDF
|
|
【 摘 要 】
BackgroundA critical aspect of the NIH Translational Research roadmap, which seeks to accelerate the delivery of "bench-side" discoveries to patient's "bedside," is the management of the provenance metadata that keeps track of the origin and history of data resources as they traverse the path from the bench to the bedside and back. A comprehensive provenance framework is essential for researchers to verify the quality of data, reproduce scientific results published in peer-reviewed literature, validate scientific process, and associate trust value with data and results. Traditional approaches to provenance management have focused on only partial sections of the translational research life cycle and they do not incorporate "domain semantics", which is essential to support domain-specific querying and analysis by scientists.ResultsWe identify a common set of challenges in managing provenance information across the pre-publication and post-publication phases of data in the translational research lifecycle. We define the semantic provenance framework (SPF), underpinned by the Provenir upper-level provenance ontology, to address these challenges in the four stages of provenance metadata:(a) Provenance collection - during data generation(b) Provenance representation - to support interoperability, reasoning, and incorporate domain semantics(c) Provenance storage and propagation - to allow efficient storage and seamless propagation of provenance as the data is transferred across applications(d) Provenance query - to support queries with increasing complexity over large data size and also support knowledge discovery applicationsWe apply the SPF to two exemplar translational research projects, namely the Semantic Problem Solving Environment for Trypanosoma cruzi (T.cruzi SPSE) and the Biomedical Knowledge Repository (BKR) project, to demonstrate its effectiveness.ConclusionsThe SPF provides a unified framework to effectively manage provenance of translational research data during pre and post-publication phases. This framework is underpinned by an upper-level provenance ontology called Provenir that is extended to create domain-specific provenance ontologies to facilitate provenance interoperability, seamless propagation of provenance, automated querying, and analysis.
【 授权许可】
CC BY
© Sahoo et al; licensee BioMed Central Ltd. 2011
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO202311109855571ZK.pdf | 4545KB |
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]
- [35]
- [36]
- [37]
- [38]
- [39]
- [40]
- [41]
- [42]
- [43]
- [44]
- [45]
- [46]
- [47]
- [48]
- [49]
- [50]
- [51]
- [52]
- [53]
- [54]
- [55]
PDF