会议论文详细信息
IAES International Conference on Electrical Engineering, Computer Science and Informatics
Automatic Text Summarization for Indonesian Language Using TextTeaser
电工学;计算机科学
Gunawan, D.^1 ; Pasaribu, A.^1 ; Rahmat, R.F.^1 ; Budiarto, R.^2
Department of Information Technology, Faculty of Computer Science and Information Technology, University of Sumatera Utara, Medan, Indonesia^1
Department of Information System, College of Computer Science and Information Technology, Albaha University, Saudi Arabia^2
关键词: Automatic text summarization;    Indonesian languages;    Information overloads;    Language independents;    Sentence length;    Text summarization;   
Others  :  https://iopscience.iop.org/article/10.1088/1757-899X/190/1/012048/pdf
DOI  :  10.1088/1757-899X/190/1/012048
学科分类:计算机科学(综合)
来源: IOP
PDF
【 摘 要 】

Text summarization is one of the solution for information overload. Reducing text without losing the meaning not only can save time to read, but also maintain the reader's understanding. One of many algorithms to summarize text is TextTeaser. Originally, this algorithm is intended to be used for text in English. However, due to TextTeaser algorithm does not consider the meaning of the text, we implement this algorithm for text in Indonesian language. This algorithm calculates four elements, such as title feature, sentence length, sentence position and keyword frequency. We utilize TextRank, an unsupervised and language independent text summarization algorithm, to evaluate the summarized text yielded by TextTeaser. The result shows that the TextTeaser algorithm needs more improvement to obtain better accuracy.

【 预 览 】
附件列表
Files Size Format View
Automatic Text Summarization for Indonesian Language Using TextTeaser 763KB PDF download
  文献评价指标  
  下载次数:21次 浏览次数:32次