期刊论文详细信息
ETRI Journal
Frequency-Temporal Filtering for a Robust Audio Fingerprinting Scheme in Real-Noise Environments
关键词: temporal filtering;    frequency filtering;    audio fingerprint;    Music information retrieval;   
Others  :  1185391
DOI  :  10.4218/etrij.06.0205.0135
PDF
【 摘 要 】

In a real environment, sound recordings are commonly distorted by channel and background noise, and the performance of audio identification is mainly degraded by them. Recently, Philips introduced a robust and efficient audio fingerprinting scheme applying a differential (high-pass filtering) to the frequency-time sequence of the perceptual filter-bank energies. In practice, however, the robustness of the audio fingerprinting scheme is still important in a real environment. In this letter, we introduce alternatives to the frequency-temporal filtering combination for an extension method of Philips’ audio fingerprinting scheme to achieve robustness to channel and background noise under the conditions of a real situation. Our experimental results show that the proposed filtering combination improves noise robustness in audio identification.

【 授权许可】

   

【 预 览 】
附件列表
Files Size Format View
20150520110901223.pdf 309KB PDF download
【 参考文献 】
  • [1]Shazam Entertainment, http://www.shazam.com.
  • [2]Gracenote, http://www.gracenote.com.
  • [3]J. Haitsma and T. Kalker, "A Highly Robust Audio Fingerprinting System," Proc. ISMIR 2002, 2002, pp. 144-148.
  • [4]C. Burges, J. Platt, and S. Jana, "Distortion Discriminant Analysis for Audio Fingerprinting," IEEE Trans. Speech and Audio Processing, vol. 11, Mar. 2003, pp. 165-174.
  • [5]M. L. Miller, M. A. Rodriguez, and I. J. Cox, "Audio Fingerprinting: Nearest Neighbor Search in High-dimensional Binary Space," IEEE Multimedia Signal Processing Workshop, Dec. 2002, pp. 182-185.
  • [6]D. Kirovski and H. Attias, "Beat-ID: Identifying Music via Beat Analysis," IEEE Multimedia Signal Processing Workshop, Dec. 2002, pp. 190-193.
  • [7]M. K. Mihcak and R. Venkatesan, "A Perceptual Audio Hashing Algorithm:?A Tool for Robust Audio Identification and Information Hiding," LNCS, vol. 2137, 2001, pp. 51-65.
  • [8]J. Chen, K. Paliwal, and S. Nakamura, "Cepstrum Derived from Differentiated Power Spectrum for Robust Speech Recognition," Speech Communication, vol. 41, Oct. 2003, pp. 469-484.
  • [9]H.-Y. Jung, "Filtering of Filter-Bank Energies for Robust Speech Recognition," ETRI J., vol. 26, no. 3, June 2004, pp. 273-276.
  • [10]C. Nadeu, D. Macho, and J. Hernando, "Time and Frequency Filtering of Filter-Bank Energies for Robust HMM Speech Recognition," Speech Communication, vol. 34, Apr. 2001, pp. 93-114.
  • [11]H. Hermansky et al., "Compensation for the Effect of the Communication Channel in the Auditory-Like Analysis of Speech (RASTA-PLP)," Proc. Eurospeech, 1991, pp. 1367-1370.
  文献评价指标  
  下载次数:14次 浏览次数:16次