期刊论文详细信息
Biodiversity Information Science and Standards
Semantic Annotation of Botanical Collection Data
article
Dominik Röpert1  Fabian Reimeier1  Jörg Holetschek1  Anton Güntsch1 
[1] Botanic Garden and Botanical Museum Berlin
关键词: Linked Open Data;    LOD;    semantic web;    Wikidata;   
DOI  :  10.3897/biss.3.36187
来源: Pensoft
PDF
【 摘 要 】

Herbarium specimens have been digitized at the Botanical Garden and Botanical Museum, Berlin (BGBM) since the year 2000. As part of the digitization process, specimen data have been recorded manually for specific basic data elements. Additional elements were usually added later based on the digital images.During the last twenty years, data were transcribed exactly as they were written on the labels, a widely used procedure in European herbaria. This approach led to a large number of orthographic variations especially with regard to person and place names.To improve interoperability between records within our own collection database and across collection databases provided by the community, we have started to enrich our metadata with Linked Open Data (LOD)-based links to semantic resources starting with collectors and geographic entities. Preferred resources for semantic enrichment (e.g., WikiData, GeoNames) have been agreed on by members of the Consortium of European Taxonomic Facilities (CETAF) in order to exploit the potential of semantically enriched collection data in the best possible way.To be able to annotate many collection records in a relatively short time, priority was given to concepts (e.g., specific collector names) that occur on many specimen labels and that have an existing and easy-to-find semantic representation in an external resource. With this approach, we were able to annotate 52,000 specimen records in just a few weeks of working time of a student assistant.The integration of our semantic annotation workflows with other data integration, cleaning, and import processes at the BGBM  is carried out using an OpenRefine-based platform with specific extensions for services and functions related to label transcription activities (Kirchhoff et al. 2018).Our semantically enriched collection data will contribute to a “Botany Pilot,” which is presently being developed by member organizations of CETAF to demonstrate the potential of Linked Open Collection Data and their integration with existing semantic resources.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO202307130002196ZK.pdf 59KB PDF download
  文献评价指标  
  下载次数:9次 浏览次数:0次