期刊论文详细信息
Journal of Computer Science
SPEECH/MUSIC CLASSIFICATION USING WAVELET BASED FEATURE EXTRACTION TECHNIQUES | Science Publications
P. Dhanalakshmi1  Thiruvengatanadhan Ramalingam1 
关键词: Audio Classification;    Feature Extraction;    Wavelet Transform;    Support Vector Machine (SVM);    Gaussian Mixture Model (GMM);   
DOI  :  10.3844/jcssp.2014.34.44
学科分类:计算机科学(综合)
来源: Science Publications
PDF
【 摘 要 】

Audio classification serves as the fundamental step towards the rapid growth in audio data volume. Due to the increasing size of the multimedia sources speech and music classification is one of the most important issues for multimedia information retrieval. In this work a speech/music discrimination system is developed which utilizes the Discrete Wavelet Transform (DWT) as the acoustic feature. Multi resolution analysis is the most significant statistical way to extract the features from the input signal and in this study, a method is deployed to model the extracted wavelet feature. Support Vector Machines (SVM) are based on the principle of structural risk minimization. SVM is applied to classify audio into their classes namely speech and music, by learning from training data. Then the proposed method extends the application of Gaussian Mixture Models (GMM) to estimate the probability density function using maximum likelihood decision methods. The system shows significant results with an accuracy of 94.5%.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201911300738502ZK.pdf 218KB PDF download
  文献评价指标  
  下载次数:4次 浏览次数:24次