会议论文详细信息
Joint Conference on Green Engineering Technology & Applied Computing 2019
The Comparison of Semantic Suffix Tree Clustering and Suffix Tree Clustering Algorithm Influence on the Accuracy Rate of an Indonesian Question Answering System
工业技术(总论);计算机科学
Isnurthina, Dininta^1 ; Ihsan Jambak, Muhammad^1 ; Yusliani, Novi^1
Faculty of Computer Science, Sriwijaya University, Palembang, Indonesia^1
关键词: Accuracy rate;    Clustering documents;    Comparison result;    Document Clustering;    Question answering systems;    Question categories;    Research analysis;    String matching;   
Others  :  https://iopscience.iop.org/article/10.1088/1757-899X/551/1/012042/pdf
DOI  :  10.1088/1757-899X/551/1/012042
来源: IOP
PDF
【 摘 要 】

This research analyses the comparison between Semantic Suffix Tree Clustering (SSTC) algorithm and Suffix Tree Clustering (STC) algorithm in clustering documents on Indonesian Question Answering System. The fundamental difference between the two algorithms is that SSTC considers the meaning of the words to generate clusters and readable labels rather than relying only on string matching like STC. As such, SSTC is able to return a more specific and accurate clusters, and has potential to increase the accuracy rate of a question answering system. However, contrary to the hypothesis, comparison results shows that the accuracy rate of Indonesian Question Answering System after the documents is clustered by SSTC is lower than by STC. The accuracy rate degradation occurred in almost every question category, except Definition category. In average, the accuracy rate obtained by Indonesian Question Answering System with SSTC is only 23.31%, while Indonesian Question Answering System with STC is able to obtain 83% accuracy rate. This significant difference indicates that Semantic Suffix Tree Clustering algorithm is not suitable in the context of document clustering on Indonesian Question Answering System.

【 预 览 】
附件列表
Files Size Format View
The Comparison of Semantic Suffix Tree Clustering and Suffix Tree Clustering Algorithm Influence on the Accuracy Rate of an Indonesian Question Answering System 254KB PDF download
  文献评价指标  
  下载次数:11次 浏览次数:40次