BMC Bioinformatics | |
Data hosting infrastructure for primary biodiversity data | |
Research | |
Phil Cryer1  Grant Yamashita2  Anthony Goddard3  Nathan Wilson4  | |
[1] Center for Biodiversity Informatics (CBI), Missouri Botanical Garden, 63119, St Louis, MO, USA;Center for Biology and Society, Arizona State University, 85287, Tempe, AZ, USA;Center for Library and Informatics, Woods Hole Marine Biological Laboratory, 02543, Woods Hole, MA, USA;Encyclopedia of Life, Center for Library and Informatics, Woods Hole Marine Biological Laboratory, 02543, Woods Hole, MA, USA; | |
关键词: Optical Character Recognition; Biodiversity Data; Global Biodiversity Information Facility; Data Preservation; Open Archival Information System; | |
DOI : 10.1186/1471-2105-12-S15-S5 | |
来源: Springer | |
【 摘 要 】
BackgroundToday, an unprecedented volume of primary biodiversity data are being generated worldwide, yet significant amounts of these data have been and will continue to be lost after the conclusion of the projects tasked with collecting them. To get the most value out of these data it is imperative to seek a solution whereby these data are rescued, archived and made available to the biodiversity community. To this end, the biodiversity informatics community requires investment in processes and infrastructure to mitigate data loss and provide solutions for long-term hosting and sharing of biodiversity data.DiscussionWe review the current state of biodiversity data hosting and investigate the technological and sociological barriers to proper data management. We further explore the rescuing and re-hosting of legacy data, the state of existing toolsets and propose a future direction for the development of new discovery tools. We also explore the role of data standards and licensing in the context of data hosting and preservation. We provide five recommendations for the biodiversity community that will foster better data preservation and access: (1) encourage the community's use of data standards, (2) promote the public domain licensing of data, (3) establish a community of those involved in data hosting and archival, (4) establish hosting centers for biodiversity data, and (5) develop tools for data discovery.ConclusionThe community's adoption of standards and development of tools to enable data discovery is essential to sustainable data preservation. Furthermore, the increased adoption of open content licensing, the establishment of data hosting infrastructure and the creation of a data hosting and archiving community are all necessary steps towards the community ensuring that data archival policies become standardized.
【 授权许可】
CC BY
© Goddard et al; licensee BioMed Central Ltd. 2011
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202311092193613ZK.pdf | 443KB | download |
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]
- [35]
- [36]
- [37]
- [38]
- [39]
- [40]
- [41]
- [42]
- [43]
- [44]
- [45]
- [46]
- [47]
- [48]
- [49]
- [50]
- [51]
- [52]
- [53]
- [54]
- [55]
- [56]
- [57]
- [58]
- [59]
- [60]
- [61]
- [62]
- [63]
- [64]
- [65]
- [66]
- [67]
- [68]
- [69]
- [70]
- [71]
- [72]
- [73]
- [74]
- [75]
- [76]
- [77]
- [78]
- [79]
- [80]
- [81]
- [82]
- [83]
- [84]
- [85]
- [86]
- [87]
- [88]
- [89]
- [90]
- [91]
- [92]
- [93]
- [94]
- [95]
- [96]
- [97]
- [98]
- [99]
- [100]
- [101]
- [102]
- [103]
- [104]
- [105]
- [106]
- [107]
- [108]
- [109]
- [110]
- [111]
- [112]
- [113]
- [114]
- [115]
- [116]
- [117]
- [118]
- [119]
- [120]
- [121]