会议论文详细信息
2018 2nd International Conference on Artificial Intelligence Applications and Technologies
Improving Named Entity Recognition using Bilingual Constraints and Word Alignment
计算机科学
Dao, An T.^1 ; Truong, Thinh H.^1 ; Nguyen, Long^1 ; Dinh, Dien^1
Faculty of Information Technology, University of Science, Hochiminh city, Viet Nam^1
关键词: Bilingual resources;    Chinese named entity recognition;    Machine translations;    Named entity recognition;    Question Answering;    State-of-the-art system;    Supervised machine learning;    Time-consuming tasks;   
Others  :  https://iopscience.iop.org/article/10.1088/1757-899X/435/1/012007/pdf
DOI  :  10.1088/1757-899X/435/1/012007
学科分类:计算机科学(综合)
来源: IOP
PDF
【 摘 要 】
Named entities carry essential meanings and information in natural language. Therefore, Named Entity Recognition has many applications in different Natural Language Processing tasks such as Information Retrieval, Information Extraction, Machine Translation, and Question Answering. State-of-the-art Named Entity Recognition systems are based on supervised machine learning algorithms which require huge amounts of training data. The main problem, however, is that constructing named entity annotated corpora is an expensive, labor-intensive, and time-consuming task. Therefore, in this paper, we propose an approach to improve monolingual Named Entity Recognition systems by exploiting an existing unannotated English-Chinese bilingual corpus. The system jointly recognizes named entities in both English and Chinese sentences through the use of bilingual constraints. Experimental results show an improvement in Named Entity Recognition of both Chinese and English compared to the strong baseline StanfordNER. In particular, Chinese Named Entity Recognition improves significantly by 20.81% in term of F 1-score. As for the English language, Named Entity Recognition F 1-score increases slightly from 75.75% to 76.08%. When comparing to the state-of-the-art system in improving Named Entity Recognition based on bilingual resources, we manage to outperform in Chinese Named Entity Recognition task by 5.99% and achieve comparable results for the English side. Our proposed method can also be generalized to apply to resource-Limited languages.
【 预 览 】
附件列表
Files Size Format View
Improving Named Entity Recognition using Bilingual Constraints and Word Alignment 196KB PDF download
  文献评价指标  
  下载次数:3次 浏览次数:25次