科技报告详细信息
Decision Combination in Speech Metadata Extraction
Lin, Xiaofan
HP Development Company
关键词: speech recognition;    metadata extraction;    decision combination;    multi-layer perceptron;    gender classification;   
RP-ID  :  HPL-2003-12R1
学科分类:计算机科学(综合)
美国|英语
来源: HP Labs
PDF
【 摘 要 】

Speech metadata extraction can both improve speech recognition and enable novel Interactive Voice Response applications. Unlike the previous research, which concentrates on the frame-level signal processing and pattern classification, this paper systematically studies the behavior of decision combination at the utterance level. We analyze the asymptotic characteristics, and the factors affecting frame-level classification. In addition, we introduce new methods to more accurately and efficiently combine frame-level decisions, including phoneme/power-based weighting and smart sampling. Experimental results in gender classification are presented. Notes: Copyright IEEE. To be published in and presented at the 37th Asilomar Conference on Signals, Systems and Computers, 9-12 November 2003, Pacific Grove, CA 5 Pages

【 预 览 】
附件列表
Files Size Format View
RO201804100001720LZ 62KB PDF download
  文献评价指标  
  下载次数:10次 浏览次数:29次