期刊论文详细信息
IEEE Access
Cluster Analysis for the Separation of Auditory Scenes
Lia M. Bonacci1  Jeffrey B. Bolkhovsky1  David H. Gever1  Matthew S. Daley1  Krystina Diaz1 
[1] Naval Submarine Medical Research Laboratory, Groton, CT, USA;
关键词: Biomedical signal processing;    clustering algorithms;    machine learning;    machine learning algorithms;    pattern clustering;    signal processing algorithms;   
DOI  :  10.1109/ACCESS.2021.3113615
来源: DOAJ
【 摘 要 】

The “cocktail party problem” refers to the ability of human listeners to separate the acoustic signal reaching their ears into its individual components, corresponding to individual sound sources in the environment. Despite this phenomenon appearing trivial for humans, solving the cocktail party problem computationally remains an ambitious challenge. The approach used in this paper takes inspiration from human strategies for separating an acoustic environment into distinct perceptual auditory streams. A series of time-frequency-based features, analogous to those thought to emerge at various stages in the human auditory processing pathway, are derived from biaural auditory inputs. These feature vectors are used as inputs to an unsupervised cluster analysis used to group feature values that are assumed to correspond to the same object. Reconstructed auditory streams are then correlated to the original components used to create the auditory scene. Our model is capable of reconstructing streams that correlate to the original components (r = 0.3-0.7) used to create the complex auditory scene. The success of the reconstructions is largely dependent on the signal-to-noise ratio of the components of the auditory scene.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次