期刊论文详细信息
Applied Sciences
Open Domain Chinese Triples Hierarchical Extraction Method
Yanli Hu1  Chunhui He1  Haoran Wang1  Zhen Tan1  Chong Zhang1  Bin Ge1 
[1] Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha 410073, China;
关键词: named entity recognition;    open relation prediction;    information extraction;    CTHE;   
DOI  :  10.3390/app10144819
来源: DOAJ
【 摘 要 】

Open domain relation prediction is an important task in triples extraction. When faced with the task of constructing large-scale knowledge graph systems, with the exception of structured data, it is necessary to automatically extract triples from a large amount of unstructured text to expand entities and relations. Although a large number of English open relation prediction methods have achieved good performance, the high-performance system for open domain Chinese triples extraction remains undeveloped due to the lack of large-scale Chinese annotation corpora and the difficulty of Chinese language processing. In this paper, we propose an integrated open domain Chinese triples hierarchical extraction method (CTHE) to solve this problem, considering the advantages of Bi-LSTM-CRF and Att-Bi-GRU models based on the pre-trained BERT encoding model. This method can recognize the named entities from Chinese sentences to establish entity pairs, and implement hierarchical extraction of specific and open relations based on the user-defined schema library and attention mechanism. The experimental results demonstrate the effectiveness of this method, which achieved stable performance on the test dataset, and better precision and F1-score in comparison with state-of-the-art Chinese open domain triples extraction methods. Furthermore, a large-scale annotated dataset for a Chinese named entity recognition (NER) task is established, which provides support for research on Chinese NER tasks.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:2次