期刊论文

【摘要】

Motivated by the inherent correlation between the speech features and their lexical words, we propose in this paper a new framework for learning the parameters of the corresponding acoustic and language models jointly. The proposed framework is based on discriminative training of the models' parameters using minimum classification error criterion. To verify the effectiveness of the proposed framework, a set of four large decoding graphs is constructed using weighted finite-state transducers as a composition of two sets of context-dependent acoustic models and two sets of n-gram-based language models. The experimental results conducted on this set of decoding graphs validated the effectiveness of the proposed framework when compared with four baseline systems based on maximum likelihood estimation and separate discriminative training of acoustic and language models in benchmark testing of two speech corpora, namely TIMIT and RM1.

【授权许可】

Unknown

【预览】

附件列表
Files	Size	Format	View
RO201912020426427ZK.pdf	1126KB	PDF	download

APSIPA Transactions on Signal and Information Processing
Joint optimization on decoding graphs using minimum classification error criterion

Abdelaziz A. Abdelhamid¹ Waleed H. Abdulla²
[1] Ain Shams University;Auckland University
关键词: Speech recognition; Weighted finite-state transducers; Discriminative training; Acoustic models; Language models;
DOI : 10.1017/ATSIP.2014.5
学科分类：计算机科学（综合）
来源: Cambridge University Press
PDF


	文献评价指标
	下载次数：14次	浏览次数：15次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】