International Journal of Biometric and Bioinformatics | |
Performance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT on Row Mean of Spectrogram for Speaker Identification | |
H. B. Kekre1  Prachi J. Natu1  Shachi J. Natu1  Tanuja Kiran Sarode1  | |
关键词: Speaker identification; Speaker Recognition; Spectrograms; DCT; Row Mean; | |
DOI : | |
学科分类:计算机科学(综合) | |
来源: Computer Science Journals | |
【 摘 要 】
The goal of this paper is to present a very simple approach to text dependent speaker identification using a combination of spectrograms and well known Discrete Cosine Transform (DCT). This approach is based on use of DCT to find similarities between spectrograms obtained from speech samples. The set of spectrograms forms the database for our experiments rather than raw speech samples. Performance of this approach is compared for different number of coefficients of DCT when DCT is applied on entire spectrogram, when DCT is applied to spectrogram divided into blocks and when DCT is applied to the Row Mean of a spectrogram. Performance comparison shows that, number of mathematical computations required for DCT on Row Mean of spectrogram method is drastically less as compared to other two methods with almost equal identification rate.
【 授权许可】
Unknown
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO201912010254931ZK.pdf | 134KB | download |