| ETRI Journal | |
| Frequency-Temporal Filtering for a Robust Audio Fingerprinting Scheme in Real-Noise Environments | |
| 关键词: temporal filtering; frequency filtering; audio fingerprint; Music information retrieval; | |
| Others : 1185391 DOI : 10.4218/etrij.06.0205.0135 |
|
PDF
|
|
【 摘 要 】
In a real environment, sound recordings are commonly distorted by channel and background noise, and the performance of audio identification is mainly degraded by them. Recently, Philips introduced a robust and efficient audio fingerprinting scheme applying a differential (high-pass filtering) to the frequency-time sequence of the perceptual filter-bank energies. In practice, however, the robustness of the audio fingerprinting scheme is still important in a real environment. In this letter, we introduce alternatives to the frequency-temporal filtering combination for an extension method of Philips’ audio fingerprinting scheme to achieve robustness to channel and background noise under the conditions of a real situation. Our experimental results show that the proposed filtering combination improves noise robustness in audio identification.
【 授权许可】
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| 20150520110901223.pdf | 309KB |
【 参考文献 】
- [1]Shazam Entertainment, http://www.shazam.com.
- [2]Gracenote, http://www.gracenote.com.
- [3]J. Haitsma and T. Kalker, "A Highly Robust Audio Fingerprinting System," Proc. ISMIR 2002, 2002, pp. 144-148.
- [4]C. Burges, J. Platt, and S. Jana, "Distortion Discriminant Analysis for Audio Fingerprinting," IEEE Trans. Speech and Audio Processing, vol. 11, Mar. 2003, pp. 165-174.
- [5]M. L. Miller, M. A. Rodriguez, and I. J. Cox, "Audio Fingerprinting: Nearest Neighbor Search in High-dimensional Binary Space," IEEE Multimedia Signal Processing Workshop, Dec. 2002, pp. 182-185.
- [6]D. Kirovski and H. Attias, "Beat-ID: Identifying Music via Beat Analysis," IEEE Multimedia Signal Processing Workshop, Dec. 2002, pp. 190-193.
- [7]M. K. Mihcak and R. Venkatesan, "A Perceptual Audio Hashing Algorithm:?A Tool for Robust Audio Identification and Information Hiding," LNCS, vol. 2137, 2001, pp. 51-65.
- [8]J. Chen, K. Paliwal, and S. Nakamura, "Cepstrum Derived from Differentiated Power Spectrum for Robust Speech Recognition," Speech Communication, vol. 41, Oct. 2003, pp. 469-484.
- [9]H.-Y. Jung, "Filtering of Filter-Bank Energies for Robust Speech Recognition," ETRI J., vol. 26, no. 3, June 2004, pp. 273-276.
- [10]C. Nadeu, D. Macho, and J. Hernando, "Time and Frequency Filtering of Filter-Bank Energies for Robust HMM Speech Recognition," Speech Communication, vol. 34, Apr. 2001, pp. 93-114.
- [11]H. Hermansky et al., "Compensation for the Effect of the Communication Channel in the Auditory-Like Analysis of Speech (RASTA-PLP)," Proc. Eurospeech, 1991, pp. 1367-1370.
PDF