期刊论文详细信息
Data Science Journal 卷:18
A Column Styled Composable Schema Matcher for Semantic Data-Types
Zhiming Zhao1  Xiaofeng Liao1  Jordy Bottelier1 
[1] System and Network Engineering Lab, Informatics Institute, University of Amsterdam, Amsterdam;
关键词: Schema Matching;    Semantic Data-types;    XML;    RDF;   
DOI  :  10.5334/dsj-2019-025
来源: DOAJ
【 摘 要 】

Schema matching exists as a long-standing challenge in many database related applications, such as data integration, where two databases with different schema have to be integrated. With the evolvement from database to big data, the schema matching has been enriched with various purposes and application contexts, ranging from data integration, to service integration, to semantic data clouding, until more recent exploratory data analysis over big data. These enriched contexts increase the demand for schema matching between semantic data-types, such as XML, RDF etc. The existing integration approaches have not dealt with the challenges of defining a relation between XML and other semantic data-types. To address these challenges, this paper studies the problem of schema mapping from XML to RDF in two folds. Firstly, testify the validity of single matcher in a column based manner for the semantic data types. Secondly, testify the validity of a highly configurable framework that utilizes hierarchical classification in order to construct a composable pipeline. We propose and implement a Reconfigurable pipeline for Semi-Automatic Schema Matching (REPSASM), which aims to solve the customizability of the matching problem by providing an environment in which a user can create, configure and experiment with their own schema-matching procedure. The experiments performed within this work show that the configurability and hierarchical classification improves the matching result, and it proposes an algorithm to automatically optimize such a hierarchy pipeline.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次