期刊论文

【摘要】

Problem statement: Researchers on Arabic speaker recognition have used local data bases unavailable to the public. In this study we would like to investigate Arabic speaker recognition using a publically available database, namely Babylon Levantine available from the Linguistic Data Consortium (LDC).Approach: Among the different methods for speaker recognition we focus on Hidden Markov Models (HMM). We studied the effect of both the parameters of the HMM models and the size of the speech features on the recognition rate. Results: To accomplish this study, we divided the database into small and medium size datasets. For each subset, we found the effect of the system parameters on the recognition rate. The parameters we varied the number of HMM states, the number of Gaussian mixtures per state, and the number of speech features coefficients. From the results, we found that in general, the recognition rate increases with the increase in the number of mixtures, till it reaches a saturation level which depends on the data size and the number of HMM states. Conclusion/Recommendations: The effect of the number of state depends on the data size. For small data, low number of states has higher recognition rate. For larger data, the number of states has very small effect at low number of mixtures and negligible effect at high number of mixtures.

【授权许可】

Unknown

【预览】

附件列表
Files	Size	Format	View
RO201911300790984ZK.pdf	350KB	PDF	download

Journal of Computer Science
Arabic Speaker Recognition: Babylon Levantine Subset Case Study \| Science Publications

Mansour Alsulaiman¹ Mohamed A. Bencherif¹ Youssef Alotaibi¹ Awais Mahmoud¹ Muhammad Ghulam¹
关键词: HMM; GMM; MFCC; Arabic speaker; Babylon; Levantine;
DOI : 10.3844/jcssp.2010.381.385
学科分类：计算机科学（综合）
来源: Science Publications
PDF


	文献评价指标
	下载次数：4次	浏览次数：14次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】