期刊论文详细信息
Sensors
Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction
Darko Brodić1  Dragan R. Milivojević2 
[1] Technical Faculty Bor, V.J. 12, University of Belgrade, 19210 Bor, Serbia; E-Mail:;Department of Informatics, Zeleni Bulevar 35, Mining and Metallurgy Institute, 19210 Bor, Serbia; E-Mail:
关键词: OCR;    document engineering;    text line segmentation;    text features;    testing;   
DOI  :  10.3390/s100505263
来源: mdpi
PDF
【 摘 要 】

Text line segmentation is an essential stage in off-line optical character recognition (OCR) systems. It is a key because inaccurately segmented text lines will lead to OCR failure. Text line segmentation of handwritten documents is a complex and diverse problem, complicated by the nature of handwriting. Hence, text line segmentation is a leading challenge in handwritten document image processing. Due to inconsistencies in measurement and evaluation of text segmentation algorithm quality, some basic set of measurement methods is required. Currently, there is no commonly accepted one and all algorithm evaluation is custom oriented. In this paper, a basic test framework for the evaluation of text feature extraction algorithms is proposed. This test framework consists of a few experiments primarily linked to text line segmentation, skew rate and reference text line evaluation. Although they are mutually independent, the results obtained are strongly cross linked. In the end, its suitability for different types of letters and languages as well as its adaptability are its main advantages. Thus, the paper presents an efficient evaluation method for text analysis algorithms.

【 授权许可】

CC BY   
© 2010 by the authors; licensee MDPI, Basel, Switzerland.

【 预 览 】
附件列表
Files Size Format View
RO202003190053724ZK.pdf 360KB PDF download
  文献评价指标  
  下载次数:1次 浏览次数:19次