期刊论文详细信息
Biodiversity Information Science and Standards
From Raw Data to Data Standards through Quality Assessment and Semantic Annotation
article
Julien Sananikone1  Elie Arnaud1  Olivier Norvez2  Sophie Pamerlon3  Anne-Sophie Archambeau4  Yvan Le Bras1 
[1] MNHN;FRB;OFB - Office Français de la Biodiversité;IRD
关键词: Ecological Metadata Language;    EML;    FAIR;    FAIR assessment;    terminological resources;    ontologies;    thesaurus;   
DOI  :  10.3897/biss.6.91205
来源: Pensoft
PDF
【 摘 要 】

Data quality and documentation are at the core of the FAIR (Findable, Accessible, Interoperable, Reusable) principles (Wilkinson et al. 2016). Regarding biodiversity and more broadly ecology domains, complementary solutions of the well-known data standard (notably through Darwin Core (Wieczorek et al. 2012)) orientation are emerging from the intensive use of EML (Ecological Metadata Language (Michener et al. 1997)) metadata standard. These notably capitalize on using:semantic annotation from EML metadata documents that describe data attributes, andFAIR quality assessment as proposed by DataOne (Data Observation Network for Earth) network.Here we propose to present this point of view by orchestrating the production of rich (with attributes description and links with terminological resources terms) EML metadata from raw datafiles and, through the generation of FAIR metrics for direct assessment of FAIRness and creation of data standards like Darwin Core. Using EML, we can describe each data attribute (e.g., name, type, unit) and associate each attribute one to several terms coming from terminological resources. Using the Darwin Core vocabulary as a terminological resource, we can thus associate, on the metadata file, original attributes terms to corresponding Darwin Core ones. Then, the data and their metadata files can be processed in order to automatically create the necessary files for a Darwin Core Archive. By acting at the metadata level, associated with accessible raw data files, we can associate raw attribute names to standardized ones, and then, potentially create data standards.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO202307130001618ZK.pdf 67KB PDF download
  文献评价指标  
  下载次数:1次 浏览次数:0次