期刊论文详细信息
Acoustical Science and Technology
Effective speech suppression using a two-channel microphone array for privacy protection in face-to-face sales monitoring
Takashi Fukuda1  Ryuki Tachibana1  Osamu Ichikawa1 
[1] IBM Research
关键词: Microphone array;    Post-filtering;    Generalized cross-correlation;    Cross-power spectrum phase;    Beamformer;   
DOI  :  10.1250/ast.36.507
学科分类:声学和超声波
来源: Acoustical Society of Japan
PDF
【 摘 要 】

References(20)In the financial industry, face-to-face conversation is an essential for sales. Similar to call-center monitoring, there is a significant need to monitor the conversation for compliance checks. In certain business scenarios, there is a need to record an employee's speech while protecting the customers' confidentiality and privacy. In this paper, we propose a small-scale microphone array system specially designed to record only the agent's speech. For the suppression of the customer's speech, we used CSP-based post-filtering. However, using small number of microphones, it is difficult to suppress unwanted speech completely. Because post-filtering using correlations of the multiple channels often affected by the spatial aliasing between speakers. We introduced the weighted-CSP to attenuate susceptible bins to the interfering speaker. Also we introduced flooring after the post-filtering to mask residuals. This combination helps prevent the customer's speech to be transcribed.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201912080715981ZK.pdf 1440KB PDF download
  文献评价指标  
  下载次数:10次 浏览次数:29次