| EURASIP Journal on Audio, Speech, and Music Processing | 卷:2023 |
| Voice activity detection in the presence of transient based on graph | |
| Empirical Research | |
| Chun-Xian Gao1  Hui Liu1  Xiao-Yuan Guo1  | |
| [1] Department of Information and Communication Engineering, Xiamen University, Xiamen, China; | |
| 关键词: Voice activity detection; Transients; Time series complex networks; Nonlinear dynamic characteristics; | |
| DOI : 10.1186/s13636-023-00282-x | |
| received in 2022-06-23, accepted in 2023-03-25, 发布年份 2023 | |
| 来源: Springer | |
PDF
|
|
【 摘 要 】
Voice activity detection remains a significant challenge in the presence of transients since transients are more dominant than speech, though it has achieved satisfactory performance in quasi-stationary noisy environments. This paper studies the differences between speech and transients in nonlinear dynamic characteristics and proposes a new method for accurately detecting speech and transients. Limited by algorithm complexity, previous research has proposed few detectors to model speech and transients based on contextual information and thus failing to detect transient frames accurately. To address this challenge, our study proposes to map features of audio signals to a time series complex network, a kind of graph data, analyzed by the Laplacian and adjacency matrix of graphs, then classified by the support vector machine (SVM) classifier. The proposed algorithm can analyze a more extended speech period, allowing the full utilization of contextual information of preceding and following frames. The experimental results show that the performance of this method has obvious superiority over other existing algorithms.
【 授权许可】
CC BY
© The Author(s) 2023
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO202304222207119ZK.pdf | 5680KB | ||
| MediaObjects/41522_2023_387_MOESM1_ESM.pdf | 7068KB | ||
| Fig. 6 | 207KB | Image | |
| Fig. 3 | 277KB | Image | |
| 467KB | Image | ||
| 615KB | Image | ||
| 13065_2022_881_Figc_HTML.gif | 138KB | Image | |
| MediaObjects/41420_2022_1011_MOESM32_ESM.jpg | 706KB | Other | |
| 13065_2022_881_Figf_HTML.gif | 64KB | Image | |
| MediaObjects/12870_2022_3891_MOESM2_ESM.xlsx | 23KB | Other | |
| 13065_2022_881_Figi_HTML.gif | 163KB | Image | |
| Fig. 3 | 832KB | Image | |
| Fig. 3 | 613KB | Image | |
| Fig. 2 | 146KB | Image | |
| Fig. 1 | 165KB | Image | |
| Fig. 6 | 84KB | Image | |
| Fig. 2 | 1268KB | Image | |
| 41535_2022_516_Article_IEq11.gif | 1KB | Image | |
| Fig. 1 | 591KB | Image | |
| MediaObjects/40249_2023_1079_MOESM1_ESM.docx | 48KB | Other | |
| Fig. 1 | 1052KB | Image | |
| Fig. 4 | 488KB | Image | |
| Fig. 2 | 330KB | Image | |
| 41535_2022_516_Article_IEq16.gif | 1KB | Image | |
| Fig. 9 | 498KB | Image | |
| Fig. 4 | 148KB | Image | |
| Fig. 4 | 185KB | Image | |
| Fig. 3 | 474KB | Image | |
| MediaObjects/12888_2022_4282_MOESM1_ESM.docx | 35KB | Other | |
| Fig. 10 | 86KB | Image | |
| MediaObjects/12888_2022_4282_MOESM2_ESM.docx | 127KB | Other | |
| Fig. 3 | 2081KB | Image | |
| MediaObjects/12888_2022_4282_MOESM3_ESM.docx | 12KB | Other | |
| MediaObjects/12888_2022_4282_MOESM4_ESM.docx | 53KB | Other | |
| MediaObjects/40560_2023_664_MOESM1_ESM.pdf | 582KB | ||
| MediaObjects/12888_2022_4282_MOESM5_ESM.docx | 43KB | Other | |
| Fig. 1 | 172KB | Image | |
| MediaObjects/12888_2022_4282_MOESM6_ESM.docx | 52KB | Other | |
| 41535_2022_516_Article_IEq31.gif | 1KB | Image | |
| MediaObjects/12888_2022_4282_MOESM7_ESM.docx | 52KB | Other | |
| Fig. 1 | 318KB | Image | |
| 41535_2022_516_Article_IEq34.gif | 1KB | Image | |
| Fig. 13 | 91KB | Image | |
| Fig. 1 | 88KB | Image | |
| Fig. 3 | 790KB | Image | |
| Fig. 2 | 1441KB | Image | |
| Fig. 8 | 224KB | Image | |
| Fig. 4 | 735KB | Image | |
| 10194_2023_1579_Article_IEq1.gif | 1KB | Image | |
| Fig. 9 | 120KB | Image | |
| MediaObjects/13063_2023_7164_MOESM2_ESM.pdf | 174KB | ||
| Fig. 1 | 2416KB | Image | |
| Fig. 14 | 335KB | Image | |
| MediaObjects/13063_2023_7164_MOESM3_ESM.pdf | 911KB | ||
| Fig. 10 | 66KB | Image | |
| Fig. 4 | 1253KB | Image | |
| Fig. 2 | 311KB | Image | |
| Fig. 11 | 64KB | Image | |
| Fig. 7 | 223KB | Image | |
| Fig. 15 | 81KB | Image | |
| 10194_2023_1579_Article_IEq13.gif | 1KB | Image | |
| Fig. 8 | 456KB | Image | |
| MediaObjects/41408_2023_829_MOESM1_ESM.pdf | 1275KB | ||
| Table 6 | 249KB | Table | |
| Fig. 5 | 755KB | Image | |
| 10194_2023_1579_Article_IEq18.gif | 1KB | Image | |
| Table 7 | 99KB | Table | |
| MediaObjects/42004_2023_867_MOESM1_ESM.pdf | 854KB | ||
| MediaObjects/13750_2022_285_MOESM1_ESM.xlsx | 36KB | Other | |
| 10194_2023_1579_Article_IEq22.gif | 1KB | Image | |
| 10194_2023_1579_Article_IEq24.gif | 1KB | Image | |
| Fig. 5 | 1544KB | Image | |
| 12942_2022_316_Article_IEq4.gif | 1KB | Image | |
| 10194_2023_1579_Article_IEq26.gif | 1KB | Image | |
| Fig. 1 | 113KB | Image | |
| MediaObjects/42004_2023_867_MOESM2_ESM.pdf | 71KB | ||
| 10194_2023_1579_Article_IEq29.gif | 1KB | Image | |
| MediaObjects/42004_2023_867_MOESM3_ESM.xlsx | 54KB | Other | |
| 10194_2023_1579_Article_IEq31.gif | 1KB | Image | |
| MediaObjects/42004_2023_867_MOESM4_ESM.xlsx | 44KB | Other | |
| 12942_2022_316_Article_IEq12.gif | 1KB | Image | |
| 40507_2023_167_Article_IEq56.gif | 1KB | Image |
【 图 表 】
40507_2023_167_Article_IEq56.gif
12942_2022_316_Article_IEq12.gif
10194_2023_1579_Article_IEq31.gif
10194_2023_1579_Article_IEq29.gif
Fig. 1
10194_2023_1579_Article_IEq26.gif
12942_2022_316_Article_IEq4.gif
Fig. 5
10194_2023_1579_Article_IEq24.gif
10194_2023_1579_Article_IEq22.gif
10194_2023_1579_Article_IEq18.gif
Fig. 5
Fig. 8
10194_2023_1579_Article_IEq13.gif
Fig. 15
Fig. 7
Fig. 11
Fig. 2
Fig. 4
Fig. 10
Fig. 14
Fig. 1
Fig. 9
10194_2023_1579_Article_IEq1.gif
Fig. 4
Fig. 8
Fig. 2
Fig. 3
Fig. 1
Fig. 13
41535_2022_516_Article_IEq34.gif
Fig. 1
41535_2022_516_Article_IEq31.gif
Fig. 1
Fig. 3
Fig. 10
Fig. 3
Fig. 4
Fig. 4
Fig. 9
41535_2022_516_Article_IEq16.gif
Fig. 2
Fig. 4
Fig. 1
Fig. 1
41535_2022_516_Article_IEq11.gif
Fig. 2
Fig. 6
Fig. 1
Fig. 2
Fig. 3
Fig. 3
13065_2022_881_Figi_HTML.gif
13065_2022_881_Figf_HTML.gif
13065_2022_881_Figc_HTML.gif
Fig. 3
Fig. 6
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]
PDF