期刊论文

【摘要】

In this paper, a voice activity detector is proposed on the basis of Gaussian modeling of noise in the spectro-temporal space. Spectro-temporal space is obtained from auditory cortical processing. The auditory model that offers a multi-dimensional picture of the sound includes two stages: the initial stage is a model of inner ear and the second stage is the auditory central cortical modeling in the brain. In this paper, the speech noise in this picture has been modeled by a 3-D mono Gaussian cluster. At the start of suggested VAD process, the noise is modeled by a Gaussian shaped cluster. The average noise behavior is obtained in different spectrotemporal space in various points for each frame. In the stage of separation of speech from noise, the criterion is the difference between the average noise behavior and the speech signal amplitude in spectrotemporal domain. This was measured for each frame and was used as the criterion of classification. Using Noisex92, this method is tested in different noise models such as White, exhibition, Street, Office and Train noises. The results are compared to both auditory model and multifeature method. It is observed that the performance of this method in low signal-to-noise ratios (SNRs) conditions is better than other current methods.

【授权许可】

Unknown

【预览】

附件列表
Files	Size	Format	View
RO201912040511362ZK.pdf	337KB	PDF	download

Signal Processing: An International Journal
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Using Spectro-Temporal Domain

Farbod Razzazi¹ Azim Fard¹ Sara Valipour¹
[1] $$
关键词: Voice activity detector; Spectro-temporal Domain; Gaussian modeling; Auditory model;
DOI :
学科分类：物理（综合）
来源: Computer Science Journals
PDF


	文献评价指标
	下载次数：8次	浏览次数：23次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】