期刊论文详细信息
Acoustical Science and Technology
Multistream sparse representation features for noise robust audio-visual speech recognition
Satoru Hayamizu1  Peng Shen2  Satoshi Tamura1 
[1] Faculty of Engineering, Gifu University;Graduate School of Engineering, Gifu University
关键词: Audio-visual speech recognition;    Sparse representation;    Noise reduction;    Joint sparsity model;   
DOI  :  10.1250/ast.35.17
学科分类:声学和超声波
来源: Acoustical Society of Japan
PDF
【 摘 要 】

References(22)In this paper, we propose to use exemplar-based sparse representation features for noise robust audio-visual speech recognition. First, we introduce a sparse representation technology and describe how noise robustness can be realized by the sparse representation for noise reduction. Then, feature fusion methods are proposed to combine audio-visual features with the sparse representation. Our work provides new insight into two crucial issues in automatic speech recognition: noise reduction and robust audio-visual features. For noise reduction, we describe a noise reduction method in which speech and noise are mapped into different subspaces by the sparse representation to reduce the noise. Our proposed method can be deployed not only on audio noise reduction but also on visual noise reduction for several types of noise. For the second issue, we investigate two feature fusion methods –- late feature fusion and the joint sparsity model method –- to calculate audio-visual sparse representation features to improve the accuracy of the audio-visual speech recognition. Our proposed method can also contribute to feature fusion for the audio-visual speech recognition system. Finally, to evaluate the new sparse representation features, a database for audio-visual speech recognition is used in this research. We show the effectiveness of our proposed noise reduction on both audio and visual cases for several types of noise and the effectiveness of audio-visual feature determination by the joint sparsity model, in comparison with the late feature fusion method and traditional methods.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201912080715896ZK.pdf 768KB PDF download
  文献评价指标  
  下载次数:11次 浏览次数:7次