科技报告详细信息
Semantic Classification
Krithara, Anastasia
HP Development Company
关键词: semantic web;    machine learning;    document classification;   
RP-ID  :  HPL-2004-182
学科分类:计算机科学(综合)
美国|英语
来源: HP Labs
PDF
【 摘 要 】

A key challenge in the semantic web is the mapping between different concepts. Many techniques for such mapping exist, but most of them induce a one-to-one mapping, which does not seem to correspond to real world problems. This project proposes a new approach, which tries to use the power of machine learning, and in particular classification algorithms, to solve the mapping task. It introduces a new semantic similarity metric which is used with semantic metadata and classification algorithms. The approach is tested in a real world dataset. Pre-processing of the dataset took place, and in particular feature selection, extraction and representation was implemented, for both content- based and semantic features. The documents of the dataset were classified using the content-based features, the semantic ones, and their combination. The results were compared and they gave us an insight of how semantic features can affect classifiers and traditional features. Notes: Anastasia Krithara, University of Bristol, Bristol, UK 70 Pages

【 预 览 】
附件列表
Files Size Format View
RO201804100000879LZ 4975KB PDF download
  文献评价指标  
  下载次数:14次 浏览次数:32次