学位论文详细信息
Speech Endpoint Detection: An Image Segmentation Approach
Speech Endpoint Detection;Speech Segmentation;Image Segmentation;Speech processing;Electrical and Computer Engineering
Faris, Nesma
University of Waterloo
关键词: Speech Endpoint Detection;    Speech Segmentation;    Image Segmentation;    Speech processing;    Electrical and Computer Engineering;   
Others  :  https://uwspace.uwaterloo.ca/bitstream/10012/7467/1/Faris_Nesma.pdf
瑞士|英语
来源: UWSPACE Waterloo Institutional Repository
PDF
【 摘 要 】

Speech Endpoint Detection, also known as Speech Segmentation, is an unsolved problem in speech processing that affects numerous applications including robust speech recognition. This task is not as trivial as it appears, and most of the existing algorithms degrade at low signal-to-noise ratios (SNRs). Most of the previous research approaches have focused on the development of robust algorithms with special attention being paid to the derivation and study of noise robust features and decision rules. This research tackles the endpoint detection problem in a different way, and proposes a novel speech endpoint detection algorithm which has been derived from Chan-Vese algorithm for image segmentation. The proposed algorithm has the ability to fuse multi features extracted from the speech signal to enhance the detection accuracy. The algorithm performance has been evaluated and compared to two widely used speech detection algorithms under various noise environments with SNR levels ranging from 0 dB to 30 dB. Furthermore, the proposed algorithm has also been applied to different types of American English phonemes. The experiments show that, even under conditions of severe noise contamination, the proposed algorithm is more efficient as compared to the reference algorithms.

【 预 览 】
附件列表
Files Size Format View
Speech Endpoint Detection: An Image Segmentation Approach 2837KB PDF download
  文献评价指标  
  下载次数:14次 浏览次数:20次