会议论文详细信息
Semantic Web and Information Extraction
Unsupervised Improvement of Named Entity Extractionin Short Informal Context Using Disambiguation Clues
Mena B. Habib and Maurice van Keulen
Others  :  http://ceur-ws.org/Vol-925/paper_1.pdf
PID  :  27557
来源: CEUR
PDF
【 摘 要 】

Short context messages (like tweets and SMS’s) are a potentially rich source of continuously and instantly updated information. Shortness and informality of such messages are challenges for Natural Language Processing tasks.Most efforts done in this direction rely on machine learning techniques which are expensive in terms of data collection and training.In this paper we present an unsupervised Semantic Web-driven approach to improve the extraction process by using clues from the disambiguation process. For extraction we used a simple Knowledge-Base matching technique combined with a clustering-based approach for disambiguation. Experimental results on a self-collected set of tweets (as an example of short context messages) show improvement in extraction results when using unsupervised feedback from the disambiguation process.

【 预 览 】
附件列表
Files Size Format View
Unsupervised Improvement of Named Entity Extractionin Short Informal Context Using Disambiguation Clues 565KB PDF download
  文献评价指标  
  下载次数:6次 浏览次数:1次