期刊论文详细信息
Acoustical Science and Technology
CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments
Norihide Kitaoka10  Yuki Denda3  Satoru Tsuge2  Takanobu Nishiura3  Masakiyo Fujimoto1  Satoshi Nakamura4,12  Tetsuya Takiguchi4  Masato Nakayama3  Takeshi Yamada4,9  Chiyomi Miyajima10  Shigeki Matsuda8  Kazuya Takeda10  Shingo Kuroiwa11  Kazumasa Yamamoto5  Satoshi Tamura6  Tetsuji Ogawa4,7 
[1] NTT Communication Science Laboratories, NTT Corporation, “Keihanna Science City,”;The University of Tokushima;Ritsumeikan University;Kobe University;Toyohashi University of Technology;Gifu University;Waseda University;ATR Spoken Language Communication Research Labs. and National Institute of Information and Communications Technology, “Keihanna Science City,”;University of Tsukuba;Nagoya University;Chiba University and National Institute of Information and Communications Technology;National Institute of Information and Communications Technology, “Keihanna Science City,”
关键词: Voice activity detection;    Noisy speech processing;    Evaluation framework;   
DOI  :  10.1250/ast.30.363
学科分类:声学和超声波
来源: Acoustical Society of Japan
PDF
【 摘 要 】

References(16)Cited-By(5)Voice activity detection (VAD) plays an important role in speech processing including speech recognition, speech enhancement, and speech coding under noisy environments. We have developed an evaluation framework for VAD under noisy environments, named CENSREC-1-C. We designed this framework for simple isolated utterance detection and hence, this framework consists of noisy continuous digit utterances and evaluation tools for VAD results. We define two evaluation measures, one for frame-level detection performance and the other for utterance-level detection performance. We also provide the evaluation results of a power-based VAD method as a reference.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201912080715717ZK.pdf 664KB PDF download
  文献评价指标  
  下载次数:18次 浏览次数:68次