期刊论文详细信息
ETRI Journal
Interference Suppression Using Principal Subspace Modification in Multichannel Wiener Filter and Its Application to Speech Recognition
关键词: speech recognition;    microphone array;    subspace;    interference suppression;    Multichannel Wiener filter;   
Others  :  1185876
DOI  :  10.4218/etrij.10.0110.0045
PDF
【 摘 要 】

It has been shown that the principal subspace-based multichannel Wiener filter (MWF) provides better performance than the conventional MWF for suppressing interference in the case of a single target source. It can efficiently estimate the target speech component in the principal subspace which estimates the acoustic transfer function up to a scaling factor. However, as the input signal-to-interference ratio (SIR) becomes lower, larger errors are incurred in the estimation of the acoustic transfer function by the principal subspace method, degrading the performance in interference suppression. In order to alleviate this problem, a principal subspace modification method was proposed in previous work. The principal subspace modification reduces the estimation error of the acoustic transfer function vector at low SIRs. In this work, a frequency-band dependent interpolation technique is further employed for the principal subspace modification. The speech recognition test is also conducted using the Sphinx-4 system and demonstrates the practical usefulness of the proposed method as a front processing for the speech recognizer in a distant-talking and interferer-present environment.

【 授权许可】

   

【 预 览 】
附件列表
Files Size Format View
20150520115304692.pdf 830KB PDF download
【 参考文献 】
  • [1]M. Brandstein and D. Ward, Eds., Microphone Arrays, Springer-Verlag, 2001.
  • [2]D. Florencio and H. Malvar, "Multichannel Filtering for Optimum Noise Reduction in Microphone Arrays," Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process., Salt Lake City, UT, USA, May 2001, pp. 197-200.
  • [3]D. Florencio and H. Malvar, "Multichannel Filtering for Optimum Noise Reduction in Microphone Arrays," Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process., Salt Lake City, UT, USA, May 2001, pp. 197-200.
  • [4]A. Spriet, M. Moonen, and J. Wouters, "Robustness Analysis of Multichannel Wiener Filtering and Generalized Sidelobe Cancellation for Multimicrophone Noise Reduction in Hearing Aid Applications," IEEE Trans. Speech Audio Process., vol. 13, no. 4, July 2005, pp. 487-503.
  • [5]S. Doclo and M. Moonen, "Combined Frequency-Domain Dereverberation and Noise Reduction Technique for Multi-microphone Speech Enhancement," Int. Workshop Acoustic Echo Noise Control, Darmstadt, Germany, Sept. 2001, pp. 31-34.
  • [6]W. Herbordt, Sound Capture for Human/Machine Interfaces, Springer-Verlag, 2005.
  • [7]G. Kim and N.I. Cho, "Principal Subspace Modification for Multi-channel Wiener Filter in Multi-microphone Noise Reduction," Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process., 2008, pp. 4909-4912.
  • [8]G.H. Golub and C.F. Van Loan, Matrix Computations, Johns Hopkins University Press, 3rd ed., 1996.
  • [9]H. Van Trees, Detection, Estimation and Modulation Theory, Part IV: Optimum Array Processing, New York: Wiley, 2002.
  • [10]R.G. Leonard, "A Database for Speaker-Independent Digit Recognition," Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process., 1984, pp. 111-114.
  • [11]"RWCP Sound Scene Database in Real Acoustical Environments," Real World Computing Partnership, (c)1998-2001.
  • [12]The CMU Sphinx Group Open Source Speech Recognition Engines. Available: http://cmusphinx.sourceforge.net
  文献评价指标  
  下载次数:12次 浏览次数:19次