期刊论文

【摘要】

Nowadays, there is a huge amount of Historical Arabic Documents (HAD) in the national libraries and archives around the world. Analyzing this type of data manually is a difficult and costly task. Thus, an automatic process is required to exploit these documents more rapidly. Processing historical documents is a recent research subject that has seen a remarkable growth in the last years. Processing Historical Arabic Documents is a particularly challenging problem. First, due to complicated nature of Arabic script compared to other scripts and second because the documents are ancient. This paper focuses on this difficult problem and provides a comprehensive survey of existing research work. First, we describe in detail the challenges making the automatic processing of Historical Arabic Documents a difficult task. Second, we classify this task into four applications of automatic processing of HAD: i) Analyze the document to extract the main text ii) Identify the writer of the document iii) Recognize some words or parts of the document in a reference dataset and iv) Retrieve and extract specific data from the document. For each application, existing approaches are surveyed and qualitatively described. Finally, we focus on available datasets and describe how they can be used in each application. (C) 2019 Elsevier Ltd. All rights reserved.

【授权许可】

Free

【预览】

附件列表
Files	Size	Format	View
10_1016_j_patcog_2019_107144.pdf	5047KB	PDF	download

PATTERN RECOGNITION	卷:100
Automatic processing of Historical Arabic Documents: A comprehensive Survey
Article
Ibn Khedher, Mohamed¹ Jmila, Houda² El-Yacoubi, Mounim A.²
[1] IRT SystemX, 8 Ave Vauve, F-91120 Palaiseau, France
[2] Inst Polytech Paris, CNRS, Telecom SudParis, Samovar, 9 Rue Charles Fourier, F-91011 Evry, France
关键词: Historical Arabic Documents; Writer identification; Data retrieval; Text analysis; Text recognition; Survey on Historical Arabic Documents;
DOI : 10.1016/j.patcog.2019.107144
来源: Elsevier
PDF


	文献评价指标
	下载次数：8次	浏览次数：1次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】