| Advances in Electrical and Computer Engineering | |
| Information Extraction Using Distant Supervision and Semantic Similarities | |
| PARK, Y1  | |
| 关键词: relation extraction; unsupervised learning; distant supervision; information extraction; natural language processing; | |
| DOI : 10.4316/AECE.2016.01002 | |
| 学科分类:计算机科学(综合) | |
| 来源: Universitatea "Stefan cel Mare" din Suceava | |
PDF
|
|
【 摘 要 】
Information extraction is one of the main research tasks in natural language processing and text mining that extracts useful information from unstructured sentences. Information extraction techniques include named entity recognition, relation extraction, and co-reference resolution. Among them, relation extraction refers to a task that extracts semantic relations between entities such as personal and geographic names in documents. This is an important research area, which is used in knowledge base construction and question and answering systems. This study presents relation extraction using a distant supervision learning technique among semi-supervised learning methods, which have been spotlighted in recent years to reduce human manual work and costs required for supervised learning. That is, this study proposes a method that can improve relation extraction by improving a distant supervision learning technique by applying a clustering method to create a learning corpus and semantic analysis for relation extraction that is difficult to identify using existing distant supervision. Through comparison experiments of various semantic similarity comparison methods, similarity calculation methods that are useful to relation extraction using distant supervision are searched, and a large number of accurate relation triples can be extracted using the proposed structural advantages and semantic similarity comparison.
【 授权许可】
Unknown
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO201901237607330ZK.pdf | 1080KB |
PDF