Journal of Computer Science | |
Comparison of Speech Features on the Speech Recognition Task | Science Publications | |
Todor Ganchev1  Nikos Fakotakis1  Iosif Mporas1  Mihalis Siafarikas1  | |
关键词: Speech parameterization; speech recognition; wavelet packets; | |
DOI : 10.3844/jcssp.2007.608.616 | |
学科分类:计算机科学(综合) | |
来源: Science Publications | |
【 摘 要 】
In the present work we overview some recently proposed discrete Fourier transform (DFT)- and discrete wavelet packet transform (DWPT)-based speech parameterization methods and evaluate their performance on the speech recognition task. Specifically, in order to assess the practical value of these less studied speech parameterization methods, we evaluate them in a common experimental setup and compare their performance against traditional techniques, such as the Mel-frequency cepstral coefficients (MFCC) and perceptual linear predictive (PLP) cepstral coefficients which presently dominate the speech recognition field. In particular, utilizing the well established TIMIT speech corpus and employing the Sphinx-III speech recognizer, we present comparative results of 8 different speech parameterization techniques.
【 授权许可】
Unknown
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO201911300281817ZK.pdf | 362KB | download |