期刊论文详细信息
Journal of Computer Science
ARABIC PERSON NAMES RECOGNITION BY USING A RULE BASED APPROACH | Science Publications
Mohd Juzaiddin Ab Aziz1  Mohammed Aboaoga1 
关键词: Named Entity;    Rule-Based Approach;    Arabic Morphological Analyzer;    Named Entity Recognition;   
DOI  :  10.3844/jcssp.2013.922.927
学科分类:计算机科学(综合)
来源: Science Publications
PDF
【 摘 要 】

Name Entity Recognition is very important task in many natural language processing applications such as; Machine Translation, Question Answering, Information Extraction, Text Summarization, Semantic Applications and Word Sense Disambiguation. Rule-based approach is one of the techniques that are used for named entity recognition to identify the named entities such as a person names, location names and organization names. The recent rule-based methods have been applied to recognize the person names in political domain. They ignored the recognition of other named entity types such as locations and organizations. We have used the rule based approach for recognizing the named entity type (person names) for Arabic. We have developed four rules for identifying the person names depending on the position of name. We have used an in-house Arabic corpus collected from newspaper achieves. The evaluation method that compares the results of the system with the manually annotated text has been applied in order to compute precision, recall and f-measure. In the experiment of this study, the average f-measure for recognizing person names are (92.66, 92.04 and 90.43%) in sport, economic and politic domain respectively. The experimental results showed that our rule-based method achieved the highest f-measure values in sport domain comparing with political and economic domains.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201911300009207ZK.pdf 148KB PDF download
  文献评价指标  
  下载次数:4次 浏览次数:11次