期刊论文详细信息
Biodiversity Information Science and Standards
Practical use of aggregator data quality metrics in a collection scenario
article
Andrew Bentley1 
[1] University of Kansas
关键词: Aggregators;    GBIF;    iDigBio;    metrics;    data quality;    collections;    IPT;   
DOI  :  10.3897/biss.2.25970
来源: Pensoft
PDF
【 摘 要 】

The recent incorporation of standardized data quality metrics into the GBIF, iDigBio, and ALA portal infrastructures enables data providers with useful information they can use to clean or augment Darwin Core data at the source based on these recommendations. Numerous taxonomic and geographic based metrics provide useful information on the quality of various Darwin Core fields in this realm, while also providing input on Darwin Core compliance for others. As a provider/data manager for the Biodiversity Institute, University of Kansas, and having spent some time evaluating their efficacy and reliability, this presentation will highlight some of the positive and negative aspects of my experience with specific examples while highlighting concerns regarding the user experience and standardization of these metrics across the aggregator landscape. These metrics have indicated both data and publishing issues that have increased the utility and cleanliness of our data while also highlighting batch processing challenges and issues with the process of inferring "bad" data. The integration of these metrics into source database infrastructure will also be postulated, with Specify Software as an example.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO202307130002388ZK.pdf 40KB PDF download
  文献评价指标  
  下载次数:9次 浏览次数:0次