期刊论文详细信息
Journal of Big Data
Arabic text summarization using deep learning approach
Said Desouki1  Molham Al-Maleh2 
[1] Faculty of Informatics and Communication Engineering, Arab International University, Damascus, Syria;Faculty of Information Technology, Higher Institute for Applied Sciences and Technology, Damascus, Syria;
关键词: Natural language processing;    Text summarization;    Deep learning;    Big data;    Sequence-to-sequence framework;   
DOI  :  10.1186/s40537-020-00386-7
来源: Springer
PDF
【 摘 要 】

Natural language processing has witnessed remarkable progress with the advent of deep learning techniques. Text summarization, along other tasks like text translation and sentiment analysis, used deep neural network models to enhance results. The new methods of text summarization are subject to a sequence-to-sequence framework of encoder–decoder model, which is composed of neural networks trained jointly on both input and output. Deep neural networks take advantage of big datasets to improve their results. These networks are supported by the attention mechanism, which can deal with long texts more efficiently by identifying focus points in the text. They are also supported by the copy mechanism that allows the model to copy words from the source to the summary directly. In this research, we are re-implementing the basic summarization model that applies the sequence-to-sequence framework on the Arabic language, which has not witnessed the employment of this model in the text summarization before. Initially, we build an Arabic data set of summarized article headlines. This data set consists of approximately 300 thousand entries, each consisting of an article introduction and the headline corresponding to this introduction. We then apply baseline summarization models to the previous data set and compare the results using the ROUGE scale.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202107033687037ZK.pdf 1954KB PDF download
  文献评价指标  
  下载次数:14次 浏览次数:10次