IAES International Conference on Electrical Engineering, Computer Science and Informatics | |
Automatic Text Summarization for Indonesian Language Using TextTeaser | |
电工学;计算机科学 | |
Gunawan, D.^1 ; Pasaribu, A.^1 ; Rahmat, R.F.^1 ; Budiarto, R.^2 | |
Department of Information Technology, Faculty of Computer Science and Information Technology, University of Sumatera Utara, Medan, Indonesia^1 | |
Department of Information System, College of Computer Science and Information Technology, Albaha University, Saudi Arabia^2 | |
关键词: Automatic text summarization; Indonesian languages; Information overloads; Language independents; Sentence length; Text summarization; | |
Others : https://iopscience.iop.org/article/10.1088/1757-899X/190/1/012048/pdf DOI : 10.1088/1757-899X/190/1/012048 |
|
学科分类:计算机科学(综合) | |
来源: IOP | |
【 摘 要 】
Text summarization is one of the solution for information overload. Reducing text without losing the meaning not only can save time to read, but also maintain the reader's understanding. One of many algorithms to summarize text is TextTeaser. Originally, this algorithm is intended to be used for text in English. However, due to TextTeaser algorithm does not consider the meaning of the text, we implement this algorithm for text in Indonesian language. This algorithm calculates four elements, such as title feature, sentence length, sentence position and keyword frequency. We utilize TextRank, an unsupervised and language independent text summarization algorithm, to evaluate the summarized text yielded by TextTeaser. The result shows that the TextTeaser algorithm needs more improvement to obtain better accuracy.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
Automatic Text Summarization for Indonesian Language Using TextTeaser | 763KB | download |