科技报告详细信息
Confidence Measures in Speech Recognition based on Probability Distribution of
Pinto, Joel ; Sitaram, R.N.V.
HP Development Company
关键词: confidence measures;    speech recognition;    acoustic likelihood;    duration;    viterbi;   
RP-ID  :  HPL-2005-144
学科分类:计算机科学(综合)
美国|英语
来源: HP Labs
PDF
【 摘 要 】

In this paper, we propose two confidence measures (CMs) in speech recognition: one based on acoustic likelihood and the other based on phone duration. For a decoded speech frame aligned to an HMM state, the CM based on acoustic likelihood depends on the relative position of its output likelihood value in the probability distribution of likelihood value in that particular state. The CM of whole phone is the geometric mean of CMs of all frames in it. The CM based on duration depends on the deviation of the observed duration from the expected duration of the recognized phone. The two CMs are combined using weighted geometric mean to obtain a hybrid phone CM. The hybrid CM shows significant improvement over the CM based on time normalized log-likelihood score. On TI-digits database, at 20% false acceptance rate, the normalized acoustic log-likelihood based CM has a detection rate of 83.8% while the hybrid CM has a detection rate of 92.4%. 4 Pages

【 预 览 】
附件列表
Files Size Format View
RO201804100001270LZ 153KB PDF download
  文献评价指标  
  下载次数:15次 浏览次数:47次