Acoustical Science and Technology | |
CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments | |
Norihide Kitaoka10  Yuki Denda3  Satoru Tsuge2  Takanobu Nishiura3  Masakiyo Fujimoto1  Satoshi Nakamura4,12  Tetsuya Takiguchi4  Masato Nakayama3  Takeshi Yamada4,9  Chiyomi Miyajima10  Shigeki Matsuda8  Kazuya Takeda10  Shingo Kuroiwa11  Kazumasa Yamamoto5  Satoshi Tamura6  Tetsuji Ogawa4,7  | |
[1] NTT Communication Science Laboratories, NTT Corporation, “Keihanna Science City,”;The University of Tokushima;Ritsumeikan University;Kobe University;Toyohashi University of Technology;Gifu University;Waseda University;ATR Spoken Language Communication Research Labs. and National Institute of Information and Communications Technology, “Keihanna Science City,”;University of Tsukuba;Nagoya University;Chiba University and National Institute of Information and Communications Technology;National Institute of Information and Communications Technology, “Keihanna Science City,” | |
关键词: Voice activity detection; Noisy speech processing; Evaluation framework; | |
DOI : 10.1250/ast.30.363 | |
学科分类:声学和超声波 | |
来源: Acoustical Society of Japan | |
【 摘 要 】
References(16)Cited-By(5)Voice activity detection (VAD) plays an important role in speech processing including speech recognition, speech enhancement, and speech coding under noisy environments. We have developed an evaluation framework for VAD under noisy environments, named CENSREC-1-C. We designed this framework for simple isolated utterance detection and hence, this framework consists of noisy continuous digit utterances and evaluation tools for VAD results. We define two evaluation measures, one for frame-level detection performance and the other for utterance-level detection performance. We also provide the evaluation results of a power-based VAD method as a reference.
【 授权许可】
Unknown
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO201912080715717ZK.pdf | 664KB | download |