学位论文

【摘要】

This thesis focuses on modifying the open source speech recognition toolkit,Kaldi, to work for the task of handwriting recognition, also called text recognition.Various methods were explored to improve the performance of thetext recognition setup. Text recognition refers to the automatic transcriptionof handwritten or printed text inputs from sources such as text pageimages, personal digital assistants, electronic white-boards or other devices.Text recognition can be performed in both online and off-line scenarios. Offlinerecognition involves recognition of handwritten images whereas on-linerecognition also stores the time trajectory information of each stroke.Handwriting recognition has long been an active area of research and usesmany of the same models used to perform automatic speech recognition (ASR).One such model used in both tasks is the Hidden Markov Model (HMM).In handwriting recognition, the text line images are treated as observationsgenerated by underlying states representing the transcription. In this thesis,a hybrid deep-neural-network-HMM (DNN-HMM) acoustic model used forASR was adapted for text recognition. To overcome a major challenge ofout of vocabulary (OOV) words, a new subword based algorithm was implementedfor lexicon and language modeling. Different data augmentation and language specific modifications such as character decomposition, andbidirectional reordering were studied. To improve the performance of our textrecognition setup, shared models, semi-supervised training and a recurrentneural network language modeling were also used. We investigated the performanceof the text recognition setup on different languages, as well as whentrained on varying amounts of data of different resolution and background.We report competitive results on several commonly used handwritten andprinted text datasets.

【预览】

附件列表
Files	Size	Format	View
Printed text and handwriting recognition	3215KB	PDF	download


Printed text and handwriting recognition
OCR;Kaldi;Electrical Engineering
Arora, AshishKhudanpur, Sanjeev ;
Johns Hopkins University
关键词: OCR; Kaldi; Electrical Engineering;
Others : https://jscholarship.library.jhu.edu/bitstream/handle/1774.2/60140/ARORA-THESIS-2018.pdf?sequence=1&isAllowed=y
瑞士\|英语
来源: JOHNS HOPKINS DSpace Repository
PDF


	文献评价指标
	下载次数：25次	浏览次数：19次

【 摘 要 】

【 预 览 】

【摘要】

【预览】