会议论文详细信息
2nd Annual International Conference on Information System and Artificial Intelligence
Word2vec and dictionary based approach for uyghur text filtering
物理学;计算机科学
Tohti, Turdi^1 ; Zhao, Yunxing^1 ; Musajan, Winira^1
School of Information Science and Engineering, Xinjiang University, Faculty of Computer, Science and Technology Building, No.666, Road, Shengli Urumqi
830046, China^1
关键词: Dictionary-based;    Filtering accuracies;    Similar pattern;    Text filtering;    Text information;    Uyghur-chinese;    Word vectors;    Wu-manber algorithms;   
Others  :  https://iopscience.iop.org/article/10.1088/1742-6596/887/1/012013/pdf
DOI  :  10.1088/1742-6596/887/1/012013
学科分类:计算机科学(综合)
来源: IOP
PDF
【 摘 要 】

With emerging of deep learning, the expression of words in computer has made major breakthroughs and the effect of text processing based on word vector has also been significantly improved. This paper maps all patterns into a more abstract vector space by Uyghur-Chinese dictionary and deep learning tool Word2vec, at first. Secondly, a similar pattern is found according the characteristics of the original pattern. Finally, texts are filtered using Wu-Manber algorithm. Experiments show that this method can get obvious filtering accuracy and recall of Uyghur text information improved.

【 预 览 】
附件列表
Files Size Format View
Word2vec and dictionary based approach for uyghur text filtering 627KB PDF download
  文献评价指标  
  下载次数:59次 浏览次数:32次