Frontiers in Neuroscience | 卷:15 |
The Relative Weight of Temporal Envelope Cues in Different Frequency Regions for Mandarin Disyllabic Word Recognition | |
Shouhuan He1  Yang Guo3  Lili Xiao4  Zhong Zheng4  Chengqi Liu4  Yanmei Feng4  Xinrong Wang4  Keyi Li5  Gang Feng6  | |
[1] Department of Otolaryngology, Qingpu Branch of Zhongshan Hospital Affiliated to Fudan University, Shanghai, China; | |
[2] Department of Otolaryngology-Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, Shanghai, China; | |
[3] Ear, Nose, and Throat Institute and Otorhinolaryngology Department, Eye and ENT Hospital of Fudan University, Shanghai, China; | |
[4] Shanghai Key Laboratory of Sleep Disordered Breathing, Shanghai, China; | |
[5] Sydney Institute of Language and Commerce, Shanghai University, Shanghai, China; | |
[6] The First Affiliated Hospital of Jinzhou Medical University, Jinzhou, China; | |
关键词: relative weight; envelope cues; frequency region; Mandarin Chinese; disyllabic word; | |
DOI : 10.3389/fnins.2021.670192 | |
来源: DOAJ |
【 摘 要 】
ObjectivesAcoustic temporal envelope (E) cues containing speech information are distributed across all frequency spectra. To provide a theoretical basis for the signal coding of hearing devices, we examined the relative weight of E cues in different frequency regions for Mandarin disyllabic word recognition in quiet.DesignE cues were extracted from 30 continuous frequency bands within the range of 80 to 7,562 Hz using Hilbert decomposition and assigned to five frequency regions from low to high. Disyllabic word recognition of 20 normal-hearing participants were obtained using the E cues available in two, three, or four frequency regions. The relative weights of the five frequency regions were calculated using least-squares approach.ResultsParticipants correctly identified 3.13–38.13%, 27.50–83.13%, or 75.00–93.13% of words when presented with two, three, or four frequency regions, respectively. Increasing the number of frequency region combinations improved recognition scores and decreased the magnitude of the differences in scores between combinations. This suggested a synergistic effect among E cues from different frequency regions. The mean weights of E cues of frequency regions 1–5 were 0.31, 0.19, 0.26, 0.22, and 0.02, respectively.ConclusionFor Mandarin disyllabic words, E cues of frequency regions 1 (80–502 Hz) and 3 (1,022–1,913 Hz) contributed more to word recognition than other regions, while frequency region 5 (3,856–7,562) contributed little.
【 授权许可】
Unknown