科技报告详细信息
Steps Toward Improved Integration, Search, and Analysis of Heterogeneous Data in the Astrobiology Habitable Environments Database
Keller, Richard M ; Detweiler, Angela M ; Lafuente Valverde, Barbara ; Blake, David F ; Bristow, Thomas F ; Cooper, George W ; Dateo, Christopher E ; Des Marais, David J ; Jahnke, Linda L ; Kubo, Michael D(Search for Extraterrestrial Intelligence Inst, Moffett Field, CA, United States)
关键词: DATA BASES;    METADATA;    EXOBIOLOGY;    HABITABILITY;    EXTRATERRESTRIAL ENVIRONMENTS;    STANDARDIZATION;    DATA SYSTEMS;    DATA MANAGEMENT;    HETEROGENEITY;   
RP-ID  :  ARC-E-DAA-TN55827
美国|英语
来源: NASA Technical Reports Server
PDF
【 摘 要 】

The Astrobiology Habitable Environments Database (AHED) is a new data system being developed as a long-term, open-access repository for astrobiology data. AHED is intended to store user-contributed results from NASA or externally-funded research in astrobiology, and to encourage sharing and synergy within the astrobiology community. However, the interdisciplinary nature of astrobiology presents some specific challenges to data management, integration, and analysis within AHED. In some disciplines (e.g., genomics), open databases thrive because the contributed products are fairly uniform and standardized (e.g., sequence data). In astrobiology, each investigation produces a unique set of data products; this makes it difficult to search across different datasets to find similar data, or to combine results from separate investigations. With AHED, we are taking steps to ensure there is adequate metadata - both at the dataset and record levels - to facilitate search, integration, and analysis. At the dataset level, we are developing a new metadata standard for describing astrobiology datasets, with detailed information about content, funding source, and scientific relevance, along with a set of topical keywords for characterizing datasets. At the record level, we are encouraging users to provide more structured content and finer-grained metadata. In many user-contributed science data repositories, few restrictions are placed on the uploaded data format, and minimal or no record-level metadata is required; thus users are unburdened when it comes to data preparation. The tradeoff is that deep integration and search across datasets is almost impossible without standardized structures and metadata. Although AHED users are free to upload minimally-described datasets, they will be encouraged to use database authoring tools (supplied by the underlying platform - Open Data Repository's Data Publisher) plus a set of customizable astrobiology-specific templates to help structure their data and provide standardized metadata. In reward for their extra effort, AHED will be able to deliver enhanced search, discovery, and analysis capabilities.

【 预 览 】
附件列表
Files Size Format View
20180006682.pdf 81KB PDF download
  文献评价指标  
  下载次数:10次 浏览次数:13次