期刊论文详细信息
Frontiers in Psychology
Linking Speech Perception and Neurophysiology: Speech Decoding Guided by Cascaded Oscillators Locked to the Input Rhythm
Oded Ghitza1 
关键词: speech perception;    memory access;    decoding time;    brain rhythms;    cascaded cortical oscillations;    phase locking;    parsing;    decoding;   
DOI  :  10.3389/fpsyg.2011.00130
学科分类:心理学(综合)
来源: Frontiers
PDF
【 摘 要 】

The premise of this study is that current models of speech perception, which are driven by acoustic features alone, are incomplete, and that the role of decoding time during memory access must be incorporated to account for the patterns of observed recognition phenomena. It is postulated that decoding time is governed by a cascade of neuronal oscillators, which guide template-matching operations at a hierarchy of temporal scales. Cascaded cortical oscillations in the theta, beta, and gamma frequency bands are argued to be crucial for speech intelligibility. Intelligibility is high so long as these oscillations remain phase locked to the auditory input rhythm. A model (Tempo) is presented which is capable of emulating recent psychophysical data on the intelligibility of speech sentences as a function of “packaging” rate (Ghitza and Greenberg, 2009). The data show that intelligibility of speech that is time-compressed by a factor of 3 (i.e., a high syllabic rate) is poor (above 50% word error rate), but is substantially restored when the information stream is re-packaged by the insertion of silent gaps in between successive compressed-signal intervals – a counterintuitive finding, difficult to explain using classical models of speech perception, but emerging naturally from the Tempo architecture.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO201901225308806ZK.pdf 2259KB PDF download
  文献评价指标  
  下载次数:12次 浏览次数:11次