学位论文详细信息
LinkWiper – A System For Data Quality in Linked Open Data
Data quality;RDF;Linked open data;Crowdsourcing;Dereferenced links;Computer Science;Computer and Information Science, College of Engineering and Computer Science
Gade, SrivalliZhu, Qiang ;
University of Michigan
关键词: Data quality;    RDF;    Linked open data;    Crowdsourcing;    Dereferenced links;    Computer Science;    Computer and Information Science, College of Engineering and Computer Science;   
Others  :  https://deepblue.lib.umich.edu/bitstream/handle/2027.42/136065/LinkWiper%20%e2%80%93%20A%20System%20For%20Data%20Quality%20in%20Linked%20Open%20Data.pdf?sequence=1&isAllowed=y
瑞士|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】
Linked Open Data (LOD) provides access to large amounts of data on Web. These data setsrange from high quality curated data sets to low quality sets. LOD sources often need strategies to clean up data and provide methodology for quality assessment in linked data. They allow interlinking and integrating any kind of data on the web. Links between various data sources enable software applications to operate over the aggregated data space as if it is a unique local database.However, such links may be broken, leading to data quality problems. In this thesis wepresent LinkWiper, an automated system for cleaning data in LOD. While this thesis focuses on problems related to dereferenced links, LinkWiper can be used to tackle any other data quality problem such as duplication and consistency. The proposed system includes two major phases.The first phase uses information retrieval-like search techniques to recommend sets of alternative links. The second phase adopts crowdsourcing mechanisms to involve workers (or users) in improving the quality of the LOD sources. We provide an implementation of LinkWiper over DBPedia, a community effort to extract structured information from Wikipedia and make this information using LOD principles. We also conduct extensive experiments to illustrate the efficiency and high precision of the proposed approach.
【 预 览 】
附件列表
Files Size Format View
LinkWiper – A System For Data Quality in Linked Open Data 1903KB PDF download
  文献评价指标  
  下载次数:12次 浏览次数:26次