Journal of Imaging | |
Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text | |
Mohamed Hamada1  Abdelrahman Abdallah2  Daniyar Nurseitov3  | |
[1] Department of Information System, International IT University, 050000 Almaty, Kazakhstan;Department of Machine Learning & Data Science, Satbayev University, 050013 Almaty , Kazakhstan;National Open Research Laboratory for Information and Space Technologies, Satbayev University, 050013 Almaty, Kazakhstan; | |
关键词: handwriting recognition; fully gated convolutional neural networks; bidirectional gated recurrent unit; deep learning; | |
DOI : 10.3390/jimaging6120141 | |
来源: DOAJ |
【 摘 要 】
This article considers the task of handwritten text recognition using attention-based encoder–decoder networks trained in the Kazakh and Russian languages. We have developed a novel deep neural network model based on a fully gated CNN, supported by multiple bidirectional gated recurrent unit (BGRU) and attention mechanisms to manipulate sophisticated features that achieve 0.045 Character Error Rate (CER), 0.192 Word Error Rate (WER), and 0.253 Sequence Error Rate (SER) for the first test dataset and 0.064 CER, 0.24 WER and 0.361 SER for the second test dataset. Our proposed model is the first work to handle handwriting recognition models in Kazakh and Russian languages. Our results confirm the importance of our proposed Attention-Gated-CNN-BGRU approach for training handwriting text recognition and indicate that it can lead to statistically significant improvements (p-value < 0.05) in the sensitivity (recall) over the tests dataset. The proposed method’s performance was evaluated using handwritten text databases of three languages: English, Russian, and Kazakh. It demonstrates better results on the Handwritten Kazakh and Russian (HKR) dataset than the other well-known models.
【 授权许可】
Unknown