期刊论文详细信息
Advances in Electrical and Computer Engineering
Information Extraction Using Distant Supervision and Semantic Similarities
PARK, Y1 
关键词: relation extraction;    unsupervised learning;    distant supervision;    information extraction;    natural language processing;   
DOI  :  10.4316/AECE.2016.01002
学科分类:计算机科学(综合)
来源: Universitatea "Stefan cel Mare" din Suceava
PDF
【 摘 要 】

Information extraction is one of the main research tasks in natural language processing and text mining that extracts useful information from unstructured sentences. Information extraction techniques include named entity recognition, relation extraction, and co-reference resolution. Among them, relation extraction refers to a task that extracts semantic relations between entities such as personal and geographic names in documents. This is an important research area, which is used in knowledge base construction and question and answering systems. This study presents relation extraction using a distant supervision learning technique among semi-supervised learning methods, which have been spotlighted in recent years to reduce human manual work and costs required for supervised learning. That is, this study proposes a method that can improve relation extraction by improving a distant supervision learning technique by applying a clustering method to create a learning corpus and semantic analysis for relation extraction that is difficult to identify using existing distant supervision. Through comparison experiments of various semantic similarity comparison methods, similarity calculation methods that are useful to relation extraction using distant supervision are searched, and a large number of accurate relation triples can be extracted using the proposed structural advantages and semantic similarity comparison.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201901237607330ZK.pdf 1080KB PDF download
  文献评价指标  
  下载次数:14次 浏览次数:20次