期刊论文详细信息
International Journal of Biometric and Bioinformatics
Performance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT on Row Mean of Spectrogram for Speaker Identification
H. B. Kekre1  Prachi J. Natu1  Shachi J. Natu1  Tanuja Kiran Sarode1 
关键词: Speaker identification;    Speaker Recognition;    Spectrograms;    DCT;    Row Mean;   
DOI  :  
学科分类:计算机科学(综合)
来源: Computer Science Journals
PDF
【 摘 要 】

The goal of this paper is to present a very simple approach to text dependent speaker identification using a combination of spectrograms and well known Discrete Cosine Transform (DCT). This approach is based on use of DCT to find similarities between spectrograms obtained from speech samples. The set of spectrograms forms the database for our experiments rather than raw speech samples. Performance of this approach is compared for different number of coefficients of DCT when DCT is applied on entire spectrogram, when DCT is applied to spectrogram divided into blocks and when DCT is applied to the Row Mean of a spectrogram. Performance comparison shows that, number of mathematical computations required for DCT on Row Mean of spectrogram method is drastically less as compared to other two methods with almost equal identification rate.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201912010254931ZK.pdf 134KB PDF download
  文献评价指标  
  下载次数:13次 浏览次数:17次