期刊论文

【摘要】

In a real environment, sound recordings are commonly distorted by channel and background noise, and the performance of audio identification is mainly degraded by them. Recently, Philips introduced a robust and efficient audio fingerprinting scheme applying a differential (high-pass filtering) to the frequency-time sequence of the perceptual filter-bank energies. In practice, however, the robustness of the audio fingerprinting scheme is still important in a real environment. In this letter, we introduce alternatives to the frequency-temporal filtering combination for an extension method of Philips’ audio fingerprinting scheme to achieve robustness to channel and background noise under the conditions of a real situation. Our experimental results show that the proposed filtering combination improves noise robustness in audio identification.

【授权许可】

【预览】

附件列表
Files	Size	Format	View
20150520110901223.pdf	309KB	PDF	download

【参考文献】

[1]Shazam Entertainment, http://www.shazam.com.
[2]Gracenote, http://www.gracenote.com.
[3]J. Haitsma and T. Kalker, "A Highly Robust Audio Fingerprinting System," Proc. ISMIR 2002, 2002, pp. 144-148.
[4]C. Burges, J. Platt, and S. Jana, "Distortion Discriminant Analysis for Audio Fingerprinting," IEEE Trans. Speech and Audio Processing, vol. 11, Mar. 2003, pp. 165-174.
[5]M. L. Miller, M. A. Rodriguez, and I. J. Cox, "Audio Fingerprinting: Nearest Neighbor Search in High-dimensional Binary Space," IEEE Multimedia Signal Processing Workshop, Dec. 2002, pp. 182-185.
[6]D. Kirovski and H. Attias, "Beat-ID: Identifying Music via Beat Analysis," IEEE Multimedia Signal Processing Workshop, Dec. 2002, pp. 190-193.
[7]M. K. Mihcak and R. Venkatesan, "A Perceptual Audio Hashing Algorithm:?A Tool for Robust Audio Identification and Information Hiding," LNCS, vol. 2137, 2001, pp. 51-65.
[8]J. Chen, K. Paliwal, and S. Nakamura, "Cepstrum Derived from Differentiated Power Spectrum for Robust Speech Recognition," Speech Communication, vol. 41, Oct. 2003, pp. 469-484.
[9]H.-Y. Jung, "Filtering of Filter-Bank Energies for Robust Speech Recognition," ETRI J., vol. 26, no. 3, June 2004, pp. 273-276.
[10]C. Nadeu, D. Macho, and J. Hernando, "Time and Frequency Filtering of Filter-Bank Energies for Robust HMM Speech Recognition," Speech Communication, vol. 34, Apr. 2001, pp. 93-114.
[11]H. Hermansky et al., "Compensation for the Effect of the Communication Channel in the Auditory-Like Analysis of Speech (RASTA-PLP)," Proc. Eurospeech, 1991, pp. 1367-1370.

ETRI Journal
Frequency-Temporal Filtering for a Robust Audio Fingerprinting Scheme in Real-Noise Environments


关键词: temporal filtering; frequency filtering; audio fingerprint; Music information retrieval;
Others : 1185391 DOI : 10.4218/etrij.06.0205.0135

PDF


	文献评价指标
	下载次数：14次	浏览次数：16次

【 摘 要 】

【 授权许可】

【 预 览 】

【 参考文献 】

【摘要】

【授权许可】

【预览】

【参考文献】