期刊论文详细信息
PeerJ
The health care and life sciences community profile for dataset descriptions
article
Michel Dumontier1  Alasdair J.G. Gray2  M. Scott Marshall3  Vladimir Alexiev4  Peter Ansell5  Gary Bader6  Joachim Baran1  Jerven T. Bolleman7  Alison Callahan1  José Cruz-Toledo8  Pascale Gaudet9  Erich A. Gombocz1,10  Alejandra N. Gonzalez-Beltran1,11  Paul Groth1,12  Melissa Haendel1,13  Maori Ito1,14  Simon Jupp1,15  Nick Juty1,15  Toshiaki Katayama1,16  Norio Kobayashi1,17  Kalpana Krishnaswami1,18  Camille Laibe1,15  Nicolas Le Novère1,19  Simon Lin2,20  James Malone1,15  Michael Miller2,21  Christopher J. Mungall2,22  Laurens Rietveld2,23  Sarala M. Wimalaratne1,15  Atsuko Yamaguchi1,16 
[1] Stanford Center for Biomedical Informatics Research, Stanford University;Department of Computer Science, Heriot-Watt University;Department of Radiation Oncology ,(MAASTRO), GROW— School for Oncology and Developmental Biology, MAASTRO Clinic;Ontotext Corporation;CSIRO;The Donnelly Centre, University of Toronto;Swiss-Prot group, SIB Swiss Institute of Bioinformatics;Carleton University;CALIPHO group, SIB Swiss Institute of Bioinformatics;IO Informatics;Oxford e-Research Centre, University of Oxford;Elsevier Labs;Department of Medical Informatics and Epidemiology, Oregon Health Sciences University;Office of Medical Informatics and Epidemiology;EMBL, European Bioinformatics Institute;Database Center for Life Science;Advanced Center for Computing and Communication;Cerenode Inc.;The Babraham Institute;Nationwide Children’s Hospital;Institute for Systems Biology;Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory;Department of Exact Sciences, VU University Amsterdam
关键词: Data profiling;    Dataset descriptions;    Metadata;    Provenance;    FAIR data;   
DOI  :  10.7717/peerj.2331
学科分类:社会科学、人文和艺术(综合)
来源: Inra
PDF
【 摘 要 】

Access to consistent, high-quality metadata is critical to finding, understanding, and reusing scientific data. However, while there are many relevant vocabularies for the annotation of a dataset, none sufficiently captures all the necessary metadata. This prevents uniform indexing and querying of dataset repositories. Towards providing a practical guide for producing a high quality description of biomedical datasets, the W3C Semantic Web for Health Care and the Life Sciences Interest Group (HCLSIG) identified Resource Description Framework (RDF) vocabularies that could be used to specify common metadata elements and their value sets. The resulting guideline covers elements of description, identification, attribution, versioning, provenance, and content summarization. This guideline reuses existing vocabularies, and is intended to meet key functional requirements including indexing, discovery, exchange, query, and retrieval of datasets, thereby enabling the publication of FAIR data. The resulting metadata profile is generic and could be used by other domains with an interest in providing machine readable descriptions of versioned datasets.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202307100014978ZK.pdf 641KB PDF download
  文献评价指标  
  下载次数:12次 浏览次数:2次