科技报告详细信息
GeneLab Analysis Working Group Kick-Off Meeting
Costes, Sylvain V
关键词: METADATA;    DATA RETRIEVAL;    DATA SYSTEMS;    GENE EXPRESSION;    PHYSIOLOGY;    HETEROGENEITY;    PROTEOME;    BIOMEDICAL DATA;    DATA MANAGEMENT;    LOGISTICS;    INFORMATION RETRIEVAL;   
RP-ID  :  ARC-E-DAA-TN52427
学科分类:生物科学(综合)
美国|英语
来源: NASA Technical Reports Server
PDF
【 摘 要 】

Goals to achieve for GeneLab AWG - GL vision - Review of GeneLab AWG charter Timeline and milestones for 2018 Logistics - Monthly Meeting - Workshop - Internship - ASGSR Introduction of team leads and goals of each group Introduction of all members Q/A Three-tier Client Strategy to Democratize Data Physiological changes, pathway enrichment, differential expression, normalization, processing metadata, reproducibility, Data federation/integration with heterogeneous bioinformatics external databases The GLDS currently serves over 100 omics investigations to the biomedical community via open access. In order to expand the scope of metadata record searches via the GLDS, we designed a metadata warehouse that collects and updates metadata records from external systems housing similar data. To demonstrate the capabilities of federated search and retrieval of these data, we imported metadata records from three open-access data systems into the GLDS metadata warehouse: NCBI's Gene Expression Omnibus (GEO), EBI's PRoteomics IDEntifications (PRIDE) repository, and the Metagenomics Analysis server (MG-RAST). Each of these systems defines metadata for omics data sets differently. One solution to bridge such differences is to employ a common object model (COM) to which each systems' representation of metadata can be mapped. Warehoused metadata records are then transformed at ETL to this single, common representation. Queries generated via the GLDS are then executed against the warehouse, and matching records are shown in the COM representation (Fig. 1). While this approach is relatively straightforward to implement, the volume of the data in the omics domain presents challenges in dealing with latency and currency of records. Furthermore, the lack of a coordinated has been federated data search for and retrieval of these kinds of data across other open-access systems, so that users are able to conduct biological meta-investigations using data from a variety of sources. Such meta-investigations are key to corroborating findings from many kinds of assays and translating them into systems biology knowledge and, eventually, therapeutics.

【 预 览 】
附件列表
Files Size Format View
20180001227.pdf 1294KB PDF download
  文献评价指标  
  下载次数:14次 浏览次数:18次