期刊论文详细信息
IEEE Access 卷:9
Deep Learning Strategy for Braille Character Recognition
Adeeba Kausar1  Muhammad Wasif2  Sajjad Manzoor3  Tasleem Kausar3  M. Adnan Ashraf3  Yun Lu4 
[1] Department of Computer Science and Information Technology, University of Narowal, Narowal, Punjab, Pakistan;
[2] Department of Electrical Engineering, University of Gujrat, Gujrat, Punjab, Pakistan;
[3] Mirpur Institute of Technology, Mirpur University of Science and Technology, Mirpur, Azad Jammu and Kashmir, Pakistan;
[4] School of Computer Science and Engineering, Huizhou University, Huizhou, China;
关键词: Braille images;    image alignment;    principal component analysis;    Wiener filtering;    convolution neural networks;    inverted residual block;   
DOI  :  10.1109/ACCESS.2021.3138240
来源: DOAJ
【 摘 要 】

People with vision impairment use Braille language for reading, writing, and communication. The basic structure of the Braille language consists of six dots arranged in three rows and two column cells, which are identified by visually impaired people using finger touch. However, it is difficult to memorize the pattern of dots that form the Braille characters. This research presents a novel approach for automatic Braille characters recognition. The designed approach works in two main stages. In first stage, image alignment & enhancement are performed using several image preprocessing techniques. In second stage, character recognition is performed with a proposed lightweight convolution neural network (CNN). As CNN shows promise for accurate recognition of optical characters. Therefore, we adopted several recently proposed state-of-the-art CNN networks for Braille characters’ recognition. To make the networks light and improve their recognition performance, we proposed a strategy by replacing few modules in the original CNNs with an inverted residual block (IRB) module with less computational cost. The novelty of this work lies in CNN model design and output performance. We executed the effectiveness of the designed setup through experiments on two different publicly available benchmark Braille datasets obtained from visually impaired people. On the English Braille and Chinese double-sided Braille image (DSBI) datasets, the proposed model shows a prediction accuracy of 95.2% and 98.3%, respectively. The reported test time of model is about 0.01s for English and 0.03s DSBI Braille images. In comparison to state-of-the-art, designed method is robust, effective, and capable to identify the Braille characters efficiently. In future, the functional performance of the proposed Braille recognition scheme will be tested through accessible user interfaces.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:2次